THIS WEE 


OCEANOGRAPHY What the 
good ship Google means 
for science p.410 


EDITORIALS 


WORLD VIEW Risky virus » DIsguIsE How the 
research must be halted 390-million-year-old 
then discussed p.411 trilobite got its spots p.412 


Under the gun 


Abanon advocacy and promotion of gun control is keeping US agencies from conducting research 
that is sorely needed to inform policy on firearms and prevent shootings. 


bill that will fund the US government for the rest of the 2013 

fiscal year. The bill contains the first congressional action on 
gun control since 20 children and six staff members were gunned 
down at Sandy Hook Elementary School in Newtown, Connecticut, 
on 14 December. But the congressional action is notable for the wrong 
reasons. Far from promising to shine badly needed light on the myriad 
unanswered questions around current policies on firearms, it instead 
continues restrictions on the gun-related research that is essential to 
wise policy-making. It is precisely the opposite of what is needed in 
the United States, where people murder each other with guns at about 
20 times the rate in other wealthy nations. 

We have been here before. After a gunman murdered 12 people 
and injured 58 in a Colorado cinema last July, this publication noted 
that the US National Rifle Association (NRA), through proxies in 
Congress, has since 1996 inserted restrictive language in the annual 
bill that funds the Centers for Disease Control and Prevention (CDC). 
The language prohibits the agency from spending any money “to advo- 
cate or promote gun control” (see Nature 488, 129; 2012). 

In 2012, that language was extended to include all of the CDC’s huge 
parent department, the Department of Health and Human Services, 
which includes the National Institutes of Health (NIH). The biomedical 
agency had drawn the gun lobby’s ire by funding a study that concluded 
that guns did not protect their owners from being shot in an assault 
(C. C. Branas et al. Am. J. Public Health 99, 2034-2040; 2009). Admit- 
tedly, the study was criticized by some for methodological weaknesses. 
But NRA chief Wayne LaPierre used it to dismiss all NIH-funded, 
gun-relevant research as “public health junk science” in a 2010 online 
commentary. Soon, Congress extended the ban to encompass the NIH. 

In the law that Obama signs this week, the NRA-backed prohibition 
will be continued. Strictly read, it does nothing to bar research, which 
is distinct from advocacy and promotion, Obama said in January, as 
he announced his response to the Newtown killings. 

That response included, commendably, a memorandum that 
directed the CDC and other agencies to launch a vigorous effort to 
study the causes of gun violence and ways to prevent it. Obama asked 
Congress to provide US$10 million in new money to the CDC to 
support its investigation of the most pressing questions, adding that 
White House lawyers do not interpret such research as contravening 
the existing prohibition. His administration argued in an accompany- 
ing document: “Research on gun violence is not advocacy; it is critical 
public health research that gives all Americans information they need” 

And they do need it. The 17-year dearth of government-funded 
research, and a parallel chill on collection of gun-related data by US 
agencies, have left politicians without the most rudimentary infor- 
mation needed to inform policy-making. Consider, for instance, that 
the most recent Department of Justice data on the proportion of gun 
sales occurring between private individuals — as opposed to sales by 


U: president Barack Obama was expected this week to sign the 


licensed dealers — date to 1994. Not since 2004 has the CDC included 
firearms-related questions in its Behavioral Risk Factor Surveillance 
System questionnaire, an instrument used to shape state policies. The 
number of academic publications relevant to firearm violence has fallen 
by 60% since peaking in 1996, according to an analysis by the coalition 
Mayors Against Illegal Guns, which looked at trends from 1960 to 2010. 

But however strong the need, Obama's 


“Obama’s research agenda will not get far in the presence 
researchagenda ofthe existing prohibition, which was crafted 
willnot get far to intimidate researchers and funding agencies 
in the presence —and has been having precisely that effect. 

of the existing The irony is that the gun lobby and its 


congressional allies might benefit from rig- 
orous research. Would a robust study reveal 
that state laws allowing citizens to carry concealed weapons have 
resulted in more or fewer deaths? We don't know. Would the spiking 
homicide rate in Chicago, Illinois, be higher still if it were not for the 
city’s restrictive gun laws, or are those laws ineffective? We don’t know. 
Does a limit on assault weapons reduce the overall rate of firearms 
injuries and deaths? We don't know. 

How many Newtowns will it take until Congress funds the research 
that will answer these and other pressing questions? Only strong evi- 
dence-based policies would allow politicians to tell grieving parents 
after the next slaying: we are doing everything we can to stop this 
happening again. = 


prohibition.” 


@ @ e@ @ 
Disciplinary action 
How scientists share and reuse information is 
driven by technology but shaped by discipline. 


around a lot in scientific publishing — and this week Nature 

presents a special series of articles that explores the industry's 
changing landscape (see page 425). But beyond the early adopters of 
digital technologies who shout freedom from the rooftops and the 
publishers who look on nervously, what do researchers make of it? 
Would it surprise you to learn, for example, that more than one-third 
of academic chemists disagree with the statement “all papers should 
be published open access”? 

The transformation of research publishing is less a revolution and 
more a war ofattrition. Battle lines were drawn long ago and all sides are 
well dug-in. In 2001, this journal published a series of viewpoints on the 
future of ‘e-access to the primary literature’ (see go.nature.com/pezj84). 


Te words ‘technology’ and ‘revolution’ are being bandied 
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Those attitudes seem strikingly familiar today. 
At the time, the founders of the Public Library 4h 
of Science initiative (then PLS, now reborn as 
the publisher PLoS) urged that research results 


THE FUTURE OF PUBLISHING 


A Nature special issue. 
nature.com/scipublishing 


information should be shared and reused. In the 
NPG survey, 45% felt that all papers should be 
published open access, but only 22% wanted to 
allow articles to be reused for commercial pur- 


should be stored in free, online, centralized 
repositories. Technology enthusiasts sang the praises of easy search 
and retrieval across a wide range of publication formats beyond the 
traditional journal article, but warned of the need for common stand- 
ards. Publishers pointed out that someone would have to finance the 
publication of the increasing tide of information, and debated where 
revenue sources should come from. 

There was a voice missing from that debate: yours. More than a dec- 
ade on, this journal's publisher, Nature Publishing Group (NPG), tried to 
remedy that by surveying more than 23,000 scientists about their experi- 
ence of and opinions on open-access publishing. The key question is not 
just what scientists could have, but what they want. The survey results 
— which NPG plans to release soon — suggest that many scientists are 
still thinking through their views on the open sharing that the Internet 
enables, and on whether they want to publish their research openly. 

One preliminary result that stands out is the diversity of experiences 
and attitudes across disciplines. In biology, 17% of papers published by 
the respondents over the past three years had been immediately made 
free for all to read by paying the publisher up front, and more than half 
of the biologists surveyed said that they had published at least one such 
paper. In chemistry, the proportion of papers was just 4%, and less than 
one-quarter of chemists had published at least one open-access paper. 
More than half of biologists felt that “all papers should be published 
open-access’, whereas just under one-third of chemists agreed (the 
remaining one-third of chemists neither agreed nor disagreed). 

Nor do scientists hold consistent views about how widely 


poses. A differently worded survey by publish- 
ers Taylor & Francis of some 14,500 scholars (split between sciences and 
the arts and humanities) found a similar inconsistency: 40% strongly 
agreed or agreed that their work should be “reused in any way’, but only 
18% said that it was acceptable for others to use their work for commer- 
cial gain (W. Frass, J. Cross and V. Gardner Open Access Survey Taylor & 
Francis; 2013). The figures perhaps represent a lack of understanding 
rather than deeply considered views. For example, when NPG asked 
scientists which open-access licence they had chosen, including share- 
alike, no derivatives and CC-BY, 85% of people said: “I don't know”. 

New technologies allow a much greater and faster transition to a 
digital future, and this week’s special issue reveals that scientists are 
finding a multitude of ways to publish and access their research results. 
As this journal has noted before, the future of research literature will 
ideally be an amalgam of papers, data and software that interlinks with 
tools for analysis, annotation, visualization and citation. The need for 
common standards is as great as ever. 

But it is demand, not supply, that will shape how scientists and 
publishers grasp these opportunities. For instance, a key reason that 
online open-access journals are now accepted as a mainstream (if still 
minority) method of publishing research is because of the mandates 
steadily introduced since 2001 by institutions and by research funders. 

The dazzling variety of publishing options will fragment the infor- 
mation available on the web. Scholars need to think through how they 
would like that information to be shared and reused — answers may be 
different for the various disciplines. One revolution does not yet fit all. = 


Push the boat out 


The latest private research vessel to be launched 
could open up the world of marine science. 


who forgets his troubles when he reads about an alternative world 

called Fantasia. So it is tempting to ask whether a new private 
research vessel, named after one of the characters from the film, can 
offer hard-pressed oceanographers a similar escape. 

As we report on page 420, the R/V Falkor has launched into troubled 
waters for marine science. Ship time is hard to come by, and funds for 
academic ocean research are shrinking, at least in the United States. 
More than ever, researchers are needing to cobble together money from 
several sources to pay for voyages that, in the past, might have been 
funded through one overarching grant. 

Arriving at a rate of knots on this scene are private foundations. 
The Gordon and Betty Moore Foundation in Palo Alto, California, for 
instance, has a marine-microbiology initiative that explores sites ranging 
from coastal upwellings to deep-sea sediments. But it does not accept 
unsolicited proposals from scientists. 

The non-profit Schmidt Ocean Institute, which operates the Falkor 
from its base in Palo Alto, does. It is open to applications to use the ship 
— for free — for studies that highlight the challenges that the world’s 
oceans face. The money comes from institute founder Eric Schmidt, 
former chief executive of Google. 

Many researchers are sceptical about the genuine scientific value of 
millionaire marine philanthropy. It is fashionable to send well-equipped 
expeditions to little-known parts of the sea, but such efforts seldom 
connect with the wider scientific community — although film direc- 
tor James Cameron said this week that he would donate his DEEPSEA 


Ts 1984 film The NeverEnding Story opens with a bullied boy 
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CHALLENGER submersible to the Woods Hole Oceanographic Institu- 
tion (WHOJD) in Massachusetts. The Falkor, however, is meant to bea 
long-term resource for use by anyone who can get his or her proposal 
accepted by a peer-review panel. 

The ship deliberately has no home port, so as to keep away from 
territorial claims to one ocean basin or for one country. US and 
Canadian scientists are scheduled to lead its first few cruises, but that 
is mainly an artefact of how the Schmidt institute started up. Its pro- 
gramme managers personally visited leading oceanographers in those 
countries to introduce the Falkor and to encourage them to apply for 
ship time. Non-Americans seem to be taking note; in the latest round 
of proposals, scientists from 23 countries applied. 

The Falkor signals the future of oceanography in other ways. It is, at 
heart, a Google ship, and as such promotes projects that integrate the 
latest technology into ocean exploration. Some academic outfits, such 
as the WHOL already push this limit, but the Google crew has some 
ideas of its own and scientists will follow them eagerly. The Schmidt 
institute also prods its shipboard scientists to release their data openly 
and rapidly, following the strong precedent set by the Moore foundation. 

A small ship such as the Falkor will not solve all the woes of ocean- 
ographers. One possible stumbling block is that the Schmidt institute 
awards only ship time; scientists must come up with their own funding 
to pay salaries and for post-cruise science. Thus, only researchers who 
are accomplished enough to secure funding from other sources will be 
able to spend time aboard. 

The Falkor has much to offer, but it will not and should not replace 
the current US research fleet, which is due to receive much-needed 
upgrades over the next ten years that will allow it to push the boundaries 
of marine science further than ever before. 

The NeverEnding Story was a joint project of 
film-makers in Germany and the United States. 
The Falkor was originally a German fishery-pro- 
tection vessel. With fair seas and a following wind, 
its story has some way to run yet. m 
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lifted a moratorium on work to make the H5N1 avian flu virus as 
transmissible between humans as seasonal flu, researchers are at 
it again. Earlier this month, a Dutch scientist proposed similar experi- 
ments with other avian flu viruses, as well as the SARS coronavirus. And 
a fortnight ago, scientists in Germany and Switzerland reported how 
they had tweaked canine distemper virus to make it grow in human cells. 

The logic behind these kinds of experiments, collectively called 
gain-of-function (GOF) research, is to identify combinations of muta- 
tions that could allow an animal virus to jump to unprepared humans. 
By knowing the mutations, the thinking goes, we can better prepare 
and marshal our scientific defences against a possible threat. 

GOF research on avian flu provoked heated controversy, much of it 
covered by this journal. That controversy did not 
go away with the lifting of the moratorium. On the 
contrary, it continues to fester. Officials in Wash- 
ington DC are putting the finishing touches to new 
guidelines for the review, regulation and oversight 
of this kind of research. The chill winds that we can 
anticipate blowing from policy-makers as a result 
could affect all of us who research viruses and their 
pathology. To avoid this, researchers in this field 
need to learn lessons from the past. 

Rather than use the avian flu moratorium to 
seek advice, listen and foster debate, many influ- 
enza scientists engaged in an academic exercise 
of self-justification. There was a single large open 
meeting, at the Royal Society in London, which 
engaged a wider audience, including bioethicists. 
The recent calling off of the moratorium by 40 flu 
researchers alone — not funders, governments or 
international bodies — says it all. The flu community simply hasn't 
understood that this is a hot-button issue that will not go away. 

There are parallels here to my own field of HIV. In the early days 
of research, HIV scientists, buoyed by huge research monies, exuded 
hubris, promised a vaccine within two years, and all sorts of other 
things. The crunch came when they realized that they had to engage 
seriously with patient groups. The result is that HIV patients became 
the most faithful collaborators of HIV clinicians. It is too easy for sci- 
entists in a field to dismiss criticism and ideas from outside. 

Here are the issues that must be openly addressed about GOF work 
with avian flu, the SARS coronavirus — or any other virus. 

First, is the virological basis for the work sound? The outcomes of 
the H5N1 experiments are dominated by the artificial-selection sys- 
tems used. If aerosol-transmitted virus is sys- 


Bi= two months after a small group of influenza virologists 
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GOING DOWN A 


BLIND ALLEY 
AND THE POWERS 
THAT BE ARE BLINDLY 
LETTING THEM GO 
DOWN THAT ALLEY. 


HSN] viral-engineering 
dangers will not go away 


Governments, funders and regulatory authorities must urgently address the 
risks posed by gain-of-function research, says Simon Wain-Hobson. 


are chosen, a transmissible virus of low virulence would ultimately 
emerge. Whether nature will take any of these courses is unknown. 
Take dog breeding. Ruthless selection of alleles over a short period has 
produced phenomenal phenotypic variation — dachshunds, salukis, 
whippets and setters. Would nature have come up with the dachshund? 

Second, infectious-disease researchers are fond of saying that 
microbes do not respect barriers. So who makes the rules and provides 
oversight? Barely a sound has emerged at the international level. The 
World Health Organization has held essentially closed-door meetings 
and has failed singularly to widen the debate. 

Third, what if these groups generate a highly pathogenic and trans- 
missible virus — which I suspect, within two years, they will? Then 
what? Should the virus be shared? Should research on this novel virus 
strain of catastrophic potential be highly restricted? 

Fourth, what if there were a leak or a small 
outbreak? Crippling lawsuits would follow. Are 
the academic institutions sufficiently covered 
in terms of insurance? Are university regents or 
chancellors even aware of the power, and dangers, 
of the modern molecular biology going on in their 
labs? Again, not a word has emerged. 

Fifth, the world has never been more densely 
populated. Is it appropriate for civilian scientists 
to make microbes more dangerous? Is creating a 
novel human virus antisocial? Was there a failure 
of duty on the part of funders and regulators? What 
is the ethical position on such work? Here there has 
been a start, but as yet there is no consensus. 

The global ramifications of GOF research 
have simply not been sufficiently explored and 
discussed. Influenza virologists are going down 
a blind alley and the powers that be are blindly letting them go down 
that alley, which is tantamount to acquiescing. So let’s be clear: the end 
game could be viruses more dangerous than the Spanish flu strain. 

H5N1 GOF work — indeed all virological GOF work — should be 
suspended until virologists open up and engage in public discussion 
of their work and the issues it raises. Given that the flu community 
failed utterly to use the year-long hiatus to good effect, it is clear that 
an independent risk—benefit assessment of GOF work is needed. 

Governments, funders and regulatory authorities should encour- 
age the influenza virologists to listen and discuss. Learned societies 
should get off the fence and speak out. A conference involving all 
the stakeholders is needed, as happened at Asilomar in the 1970s for 
recombinant DNA. The problem will not go away. It has to be engaged 
and it has to be done now. m 


Simon Wain-Hobson is chair of the Foundation for Vaccine Research, 
which is based in Washington DC. 
e-mail: simon. wain-hobson@pasteur.fr 
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RESEARCH HIGHLIGHTS 


Male mice cut out 
for fathering 


Male mice that live with their 
mates help to nurture pups, 
whereas virgin males attack 
them, owing to differences 
in neural signalling. Kashiko 
Tachikawa and her colleagues 
at the RIKEN Brain Science 
Institute in Saitama, Japan, 
compared how sexually naive 
males and cohabiting fathers 
reacted to pups displaced 
from their nests, and found 
that their behaviour was 
mediated by the pheromone- 
detecting vomeronasal 

organ located at the base of 
the nasal cavity. Most of the 
virgin males attacked the 
pups, and the neural pathways 
that link the vomeronasal 
organ to other areas in their 
brains were highly activated 
in these animals. Cohabiting 
fathers, however, carried 
pups back to their nests and 
displayed little activation 

in these neural pathways. 
Sexually naive males showed 
parental behaviour if their 
vomeronasal organ had been 
removed. 

J. Neurosci. 33, 5120-5126 
(2013) 


Trilobite fossil 
spotted 


Speckles on 
390-million- 
year-old 
trilobite fossils 
(pictured) may 
have helped 
to camouflage 
the marine 
invertebrates. 
A team led 
by Christopher 
McRoberts, of the State 
University of New York at 
Cortland and Thomas Hegna 
at Western Illinois University 


Selections from the 
scientific literature 


Primates hunting leaves forest scar 


By hunting for primates, humans may create an 
ecological ripple that makes the hunted regions 
permanently unsuitable for the animals. 

Ola Olsson at Lund University in Sweden 
and his colleagues looked at six sites in three 
protected areas in south-eastern Nigeria. 

Three of the sites were well protected from 
hunters, and had more than three times as 
many primate groups and more than twice as 


many fruit-tree seedlings as the hunted sites. 
The researchers suggest that with many fewer 
primates eating fruits and dispersing the seeds 
by spitting and defaecation, fruit trees do not 
reproduce as well in hunted areas. As these 
hunted forests age, the lack of seedlings will 
mean the dying trees are not replaced, and 
primates will be unlikely to find enough to eat. 
Proc. R. Soc. B 280, 20130246 (2013) 


in Macomb examined 
more than 25 Eldredgeops 
rana trilobites that were 
discovered in central and 
western New York state. The 
fossil exoskeletons were 
dotted with regular 
arrangements 
A ofa variable 
= number 

of spots 

— more 

than 500 in 
some cases. 
Chemical and 
mineral analyses indicated 
that spots are of similar 
composition to the rest of 
the fossil, suggesting the 
markings are not artefacts. 

No pigment molecules were 
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preserved, but the authors 
speculate that the spots may 
have scattered sunlight and 
blurred a trilobite’s shadow. 
Geology http://dx.doi. 
org/10.1130/G34158.1 (2013) 


Serotonin 
signalling secrets 


Researchers have deciphered 
molecular structures of two 
neurotransmitter receptors 
bound with drug molecules. 
Bryan Roth at the 

University of North Carolina 
Chapel Hill Medical School 
and Raymond Stevens at the 
Scripps Research Institute in 
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La Jolla, California, and their 
colleagues, solved the crystal 
structure of a serotonin 
receptor called 5-HT,, 
bound to the migraine 
medication ergotamine, 
which is chemically related 
to the hallucinogen lysergic 
acid diethylamide (LSD). 

Ina related paper, a 
team led by Stevens and 
Eric Xu at the Van Andel 
Research Institute in Grand 
Rapids, Michigan, solved 
the structure of another 
serotonin receptor, 5-HT\,, 
bound to ergotamine. 
Differences between the 
structures show how similar 
drugs can have different 
biological effects. 


TIM LAMAN/NATUREPL.COM 


CHRISTOPHER MCROBERTS 


IDA/DLR/MPS/UCLA/JPL-CALTECH/NASA 
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Such insight could help 
to design receptor-specific 
therapeutics and limit side 
effects, the authors say. 
Science http://dx.doi.org/ 
10.1126/science.1232807 
(2013); Science http://dx.doi. 
org/10.1126/science.1232808 
(2013) 

For a longer story on this research, 
see go.nature.com/yw8pdm 


Single cells seen 
in whole brains 


A fast imaging technique 
can monitor the activity of 
individual neurons across 
the brain of fish larvae, 
providing both a detailed 
and comprehensive view of a 
vertebrate brain. 

Misha Ahrens and 
Philipp Keller at Janelia 
Farm Research Campus in 
Ashburn, Virginia, imaged 
the brains of transparent 
larval zebrafish (Danio 
rerio), which had been 
engineered to have neurons 
that fluoresce when active. 
The authors improved ona 
technique that uses planes 
of laser light to capture 
images layer by layer, called 
light-sheet microscopy, 
by increasing the imaging 
speed. The method could 
cover the full brain of a 
5-day-old zebrafish larva 
every 1.3 seconds, detecting 
more than 80% of its 
100,000-odd neurons at the 
single-cell level (pictured). 

Such images can help 
researchers to map activity 


across the brain and show 
which parts of the brain 
function together, the 
researchers say. 

Nature Meth. http://dx.doi. 
org/10.1038/nmeth.2434 (2013) 
For a longer story on this research, 
see go.nature.com/f8dtkd 


Fat cells use 
immune signal 


In obese individuals, fat cells 
may act like inflammatory 
white blood cells by using 
communication machinery 
once considered to be 
exclusive to immune cells. 

High numbers of several 
types of immune cell patrol the 
fat tissues of people who are 
obese. This inflammation is 
highly correlated with health 
risks such as insulin resistance 
and metabolic syndrome, 
although how it gets started is 
unclear. Willa Hsueh and her 
colleagues at the Methodist 
Hospital in Houston, Texas, 
show that it may begin with 
adipocytes, or fat cells. The 
authors compared gene 
expression in lean and obese 
women and noticed that 
adipocytes expressed many 
genes connected to the major 
histocompatibility complex 
I, which certain white blood 
cells use to activate other 
white blood cells. An array of 
follow-up studies in mice fed 
high-fat diets indicate that it 
is the fat cells themselves that 
prompt inflammation. 

The researchers suggest that 
obesity triggers alterations in 
fat cells that are then escalated 
by white blood cells. 

Cell Metab. 17, 411-422 (2013) 


DEVELOPMENT 


Origins of the 
ear lining 


In mammalian embryos, the 
middle-ear chamber starts as 
a pouch that pokes out from 
the oral cavity. The roof of this 
cavity ruptures and fills with 
cells that then retract up over 
the bones of the middle ear, 
leaving an air-filled space. 
Hannah Thompson and 


RESEARCH HIGHLIGHTS BiiiSaiiaa¢ 


Long DNA-like chains assemble 


> HIGHLY READ 


Extremely long molecular chains can 
self-assemble from two nucleobase- 


like chemicals, report Nicholas Hud at 
the Georgia Institute of Technology in 


Atlanta, and his co-workers. 


The team showed that three of each of the monomers 
cyanuric acid and a pyrimidine derivative can forma 
rosette structure. When combined in water, the rosettes 
stacked themselves into long chains that were more than 
1 micrometre in length, the longest self-assembling water- 
soluble structures ever made from such small molecules. 
Hydrogen bonding and stacking between the monomers 
was similar to that seen between individual nucleobases 
in DNA and RNA. The work provides insight into how a 
self-assembling system might have existed in a prebiotic 


world. 


J, Am. Chem. Soc. 135, 2447-2450 (2013) 


Abigail Tucker at King’s 
College London looked at 
mice genetically engineered 
such that the descendants 

of cells in the developing 

ear could be tracked. This 
revealed a double origin for 
the epithelial cells within the 
cavity, and thus in two cell 
types. Cells lining the bottom 
of the cavity are thick and have 
cilia that robustly sweep away 
debris. Cells lining the roof are 
flat and offer little protection 
from debris or inflammation. 
The way in which the 
mammalian ear develops and 
our susceptibility to middle- 
ear infections may be related, 
the authors suggest. 

Science 339, 1453-1456 (2013) 


Migrating planets 
sped up collisions 


The pockmarked asteroid 
Vesta (pictured) has provided 
more hints about the history 
of the early Solar System. 

As giant planets such as 
Jupiter and Saturn shifted 
into place about 4.1 billion 
years ago, their changing 
gravitational pull flung 
asteroid fragments into highly 
eccentric orbits. Those space 


rocks, in turn, slammed into 
other asteroids at high speed, 
heating them and altering 
their chemistry. 

This scenario, proposed 
by Simone Marchi at the 
Southwest Research Institute 
in Boulder, Colorado, and 
his colleagues, could explain 
why the asteroid Vesta was 
pummelled so frequently 
between 4.1 billion and 3.4 
billion years ago. The team’s 
simulations suggest that 
meteors were slung at much 
higher velocities than once 
thought, for a period that 
lasted hundreds of millions 
of years. 

Nature Geosci. http://dx.doi. 
org/10.1038/ngeo1769 (2013) 
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SEVEN DAYS nse: 


US budget boost 


Some US science agencies 

will receive a modest funding 
boost under government 
spending plans worth 

US$984 billion in total for 
2013, as approved by Congress 
on 21 March. The budgets 
announced for the National 
Science Foundation, Food and 
Drug Administration and the 
National Institutes of Health 
provide follow-on finances 
from current spending plans 
that expire on 27 March. 

But the budget increases 

do not negate across-the- 
board funding cuts of 5.1% 
that began on 1 March. 

See go.nature.com/pvoord 

for more. 


French protest 
French scientists went on 
strike on 21 March to protest 
against a bill to reform 
research and higher education. 
“French universities are 

being deliberately driven 

to bankruptcy by laws,’ 
campaigners said. Research 
minister Genevieve Fioraso 
acknowledged that budget 
deficits were limiting 
government spending on 
science. The bill was adopted 
by the French cabinet on 

20 March and will be presented 
to parliament on 27 May. 

It should become law by the 
end of July. See go.nature.com/ 
xz7sl7 for more. 


The amount (US$6.7 million) 
anonymously donated on 

19 March to Britain’s cash- 
strapped Royal Institution to 
save it from having to sell its 
214-year-old London home 
to pay off debts. 


Apollo engines rise from the deep 


Parts of the Saturn V rocket engines that 
powered NASA’ Apollo astronauts to the 

Moon have been recovered from the floor of 
the Atlantic Ocean. The recovery team — led 
by Jeff Bezos, founder of the online company 
Amazon.com — announced on 20 March that it 
had hauled up enough pieces to rebuild two F-1 


Stem-cell grants 

The California Institute 

for Regenerative Medicine 
(CIRM) in San Francisco has 
awarded US$32 million in 
grants for research on human 
induced pluripotent stem (iPS) 
cell lines. Winners include 
seven California-based teams 
that will derive iPS cells from 
specific tissues to model 
disease in individual patients. 
The United States lacks a 
central iPS-cell bank like 

those proposed in Japan, but 
the California initiative will 
give US researchers access to 
well-characterized iPS lines for 
research, says the CIRM. 


Canadian budget 
Canada’s 2013 budget, 
published on 21 March, 
has been welcomed by 
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applied researchers but 
criticized by basic scientists. 
It includes Can$383 million 
(US$383 million) for research 
infrastructures, university 
research partnerships with 
industry and support for 


growing innovative businesses. 


But James Turk, executive 
director of the Canadian 
Association of University 
Teachers in Ottawa, notes that 
“there is a consistent pattern 
of steering money away from 
basic research” See go.nature. 
com/5plxbz for more. 


DNA transplants 
The United Kingdom has 
edged closer to being the 
first nation to legalize in vitro 
fertilization techniques 
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engines, which lifted the rockets off the launch 
pad and then fell into the sea (pictured). NASA 
launched 13 Saturn Vs, each of which had five 
F-1 engines, between 1967 and 1973. “We found 
so much,’ says Bezos, “an incredible sculpture 
garden of twisted F-1 engines that tells the story 
ofa fiery and violent end” 


that prevent children from 
inheriting diseases caused by 
faulty maternal mitochondrial 
DNA. Legislators in London 
agreed on 20 March to 
provide the government 

with regulatory advice on 
technologies that transplant 
nuclear DNA from an egg 
with diseased mitochondria 
into a healthy donor cell. A 
consultation found that there 
was broad public support for 
the techniques if they are used 
to prevent serious diseases. 
The UK government will now 
decide whether to legalize the 
technology. See go.nature. 
com/gxvwkv for more. 


Misconduct rebuttal 
The US Department of the 
Interior (DOI) has rejected 
allegations brought by 


BEZOS EXPEDITIONS 


Paul Houser, a former DOI 
scientific-integrity officer 
who claimed in 2012 that 

° he was unfairly dismissed 

for identifying misconduct. 
The DOT rejected the claims 
in its first report on alleged 
misconduct since introducing 
a scientific-integrity policy 

in 2011. The report was 

sent on 19 March to Public 
Employees for Environmental 
Responsibility, a watchdog 
group in Washington DC. 

See go.nature.com/1jcafh 

for more. 


AWARDS 


Physics prize 

The 2013 Fundamental 
Physics Prize was awarded 

to Alexander Polyakov of 
Princeton University in New 
Jersey for his work on the 
quantum mathematics that 
underpin the standard model 
of particle physics. The prize, 
announced on 20 March and 
worth US$3 million, is an 
annual award established by 
Russian Internet entrepreneur 
Yuri Milner in 2012 to honour 
recent advances in physics. 
See go.nature.com/a7u4hr 

for more. 


LIFF MOORE 


Abel prize winner 
Belgian mathematician 

Pierre Deligne (pictured) 

has won the Abel Prize, 

one of the most prestigious 
awards in mathematics, worth 
6 million Norwegian kroner 


(US$1 million). Deligne, 

who works at the Institute for 
Advanced Study in Princeton, 
New Jersey, was awarded the 
prize on 20 March for his 
research over the past 50 years 
connecting algebra and 
geometry, and for advancing 
number and representation 
theory. The prize is awarded 
annually by the Norwegian 
Academy of Science and 
Letters in Oslo and is 

named after the Norwegian 
mathematician Niels Henrik 
Abel. See go.nature.com/ 
psnyu2 for more. 


| RESEARCH 
Neanderthal genome 


The sequence of the entire 
Neanderthal genome has been 
completed from a toe bone 
found in a cave in central 
Siberia. Researchers at the 
Max Planck Institute for 
Evolutionary Anthropology in 


Leipzig, Germany, announced 
on 19 March that they had 
generated a genome sequence 
50 times the quality of a draft 
Neanderthal genome released 
in 2010 (R. E. Green et al. 
Science 328, 710-722; 2010). 
The higher quality genome, 
which is available online (see 
go.nature.com/o2kd5d), 
could help researchers to chart 
the evolutionary histories of 
Neanderthals, their Denisovan 
relatives and modern humans 
more accurately. 


Big Bang pictures 
The Planck space telescope has 
produced the most detailed 
images yet of the residual glow 
from the Big Bang. Scientists 
from the European Space 
Agency unveiled the results, 
which give clues to the first 
moments of the Universe, on 
21 March. See page 417 and 
go.nature.com/utdcjm for 
more. 


Solar bankruptcy 
A Chinese subsidiary of the 
world’s largest manufacturer 
of solar photovoltaic panels 
announced bankruptcy on 
20 March after defaulting 

on US$541 million in loans. 
Suntech Power Holdings was 
founded in Wuxi, China, in 
2001, and led the solar market 
until fierce competition from 
2008 onwards depressed 
prices. Eight Chinese banks 


JAPAN’S FOSSIL-FUEL RELIANCE 


SOURCE: US EIA/BLOOMBERG 


TREND WATCH 


The amount of electricity 
generated from fossil fuels rose 
to 90% of Japan’s total electricity 
output during 2012, according 
to the US Energy Information 
Administration (see chart). 
Only two nuclear reactors are 
operating two years after the 
March 2011 earthquake and 
tsunami that led to three reactor 
meltdowns at the Fukushima 
Daiichi nuclear power plant. 
Japan has imported liquid 
natural gas and has increased its 
consumption of crude oil and 
heavy fuel oil to meet demand. 


Nuclear-plant outages led to a 21% rise in the use of fossil fuels for 
electricity generation in Japan in 2012. 


Xe} 
Oo 


[Fossil fuels J Nuclear [§§) Hydroelectric 


80 


| 
Oo 


fen) 
fo} 


oa 
fo} 


™~ 
jo} 


30. 


DS) 
io} 


~ 
n 
(g 
3 
fo} 
£ 
ry 
= 
6 
RY 
o 
& 
n 
v 
# 
ip 
3 
> 
a 
= 
2 
a 
& 
© 
o 
fa 
& 
oO 


KR 
[o) 


jo) 


2007 2008 2009 2010 2011 2012 


SEVEN DAYS | THIS WEEK | 


1 APRIL 

UK funding agencies 
introduce policies that 
require research to be 
published as open access. 
go.nature.com/7Ivzll 


2-5 APRIL 

In Seefeld, Austria, 
researchers meet to 
discuss how ecosystems 
respond to climate 
variability and extreme 
weather. 
go.nature.com/m7jb9n 


asked a court to declare the 
subsidiary, Wuxi Suntech, as 
insolvent. The subsidiary did 
not object to the petition. 


PEOPLE 


Basic science chief 
The National Institute of 
General Medical Sciences at 
the US National Institutes of 
Health in Bethesda, Maryland, 
appointed a new director 

on 25 March. Jon Lorsch, a 
biochemist at Johns Hopkins 
University in Baltimore, 
Maryland, will head the 
US$2.4-billion-a-year basic- 
research institute, which 

has been overseen by acting 
director Judith Greenberg 
since 2011. 


Australian tensions 
Australia has its fourth 

new cabinet minister for 
science and research in 

less than 16 months, after 
prime minister Julia Gillard 
appointed Craig Emerson to 
the post on 25 March. The 
worrying instability is the 
result of leadership tensions 
since Gillard deposed former 
prime minister Kevin Rudd 
in 2010. On 22 March, former 
science minister Chris Bowen 
(a Rudd supporter) resigned 
after a failed attempt to unseat 
Gillard. See go.nature.com/ 
jde9dw for more. 
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ITALY Minister gives green 
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stem-cell treatments p.418 
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scam p.421 massive collider p.422 ‘ 


COSMOLOGY 


SPECIALISSUE Nature explores 
. the new landscape of 
publishing p.425 


Planck has unveiled the 
sharpest picture of light 
from the 380,000-year- 
old Universe. 


Planck snaps infant Universe 


Space telescope culls exotic creation theories with ultra-precise microwave map. 


BY MARK PEPLOW 


or astronomers, it is the ultimate treasure 
map. On 21 March, the Planck space tele- 
scope team released the highest-precision 
map yet of the cosmic microwave background 
(CMB), the faint but ubiquitous afterglow of 
the Big Bang. Crowning nearly 50 years of CMB 
study, the map records the precise contours of 
the nascent Universe — and in doing so pins 
down key parameters of the Universe today. 
The tiny fluctuations embedded in the 
CMB map reveal a Universe that is expanding 
slightly more slowly than had been thought. 
That dials back the amount of gravity-counter- 
ing ‘dark energy’ to 68.3% of the Universe, and 
adds a little more unseen dark matter to the 
mix. It also means that the Universe is a little 
older: 13.82 billion years old, adding a few tens 
of millions of years to the previously calculated 
value. The map even shows that the number 
of neutrino ‘flavours’ permeating the cosmos 
will probably remain at three — had there been 


a fourth, the Universe would have expanded 
more quickly during its first moments. 

These results represent refinements of num- 
bers obtained by previous missions such as 
the Wilkinson Microwave Anisotropy Probe 
(WMAP). Where the Planck spacecraft, watch- 
ing the sky from a vantage point 1.5 million 
kilometres away, breaks the most new ground 
is in its support for the reigning theory that 
describes the instant after the Big Bang. The 
theory, known as inflation, holds that during 
an unimaginably rapid expansion lasting just 
10” seconds or so, the Universe grew from 
a subatomic point to something the size of a 
grapefruit that then continued to expand at a 
more stately pace. This growth spurt would 
help to explain why the Universe we see today 

is homogeneous on the 


> NATURE.COM largest scales, yet riddled 
For more with clumps, filaments 
on the Planck and sheets of galaxies. 

mission: “Planck could have 
go.nature.com/ylwzmu © found that there was 
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something majorly wrong with inflation,” says 
astrophysicist Jo Dunkley at the University of 
Oxford, UK, who has worked on data from 
Planck and the WMAP. “Instead, we've got new 
evidence that this expansion did happen.” 

In the minutes that followed the burst 
of inflation, particles such as protons and 
electrons formed from the cauldron of proto- 
matter, and photons began to bounce around 
like pinballs. It was only 380,000 years later, 
when the charged plasma cooled into neutral 
atoms, that those photons could fly freely. 
Today they make up the CMB, and carry with 
them an imprint of the quantum fluctuations 
that roiled the inflationary Universe. 

Seen in the map as tiny variations around an 
average temperature of 2.7 kelvins, the fluctua- 
tions caused alterations in the density of matter, 
which ultimately snowballed into the galaxies 
seen today. “All the structures we see in the 
Universe are coming from these little perturba- 
tions,” says Paul Shellard, a Planck cosmologist 
at the University of Cambridge, UK. > 
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PLANCK COLLABORATION/ESA 


| NEWS IN FOCUS 


> Inflation, proposed in 1980 by Alan Guth, 
a physicist now at the Massachusetts Institute 
of Technology in Cambridge, predicts that the 
range of temperature variations should follow 
a bell curve — a smooth Gaussian distribu- 
tion. Cosmologists had found hints in previ- 
ous missions that the distribution was not so 
smooth, suggesting that some other process 
was involved in inflating the Universe (see 
Nature http://doi.org/bgjd86; 2008). But so far, 
Planck’s temperature data look almost perfectly 
Gaussian, and standard theories for inflation 
are looking ever more secure. 

“A lot of baroque inflationary models are 
gone,’ says Paul Steinhardt, a theoretical physi- 
cist at Princeton University in New Jersey, who 
has tried to poke holes in inflation by propos- 
ing theories such as ekpyrosis, which invokes 
a cyclical Universe that dies and is reborn ina 
series of Big Bounces. 

But the cosmological case is not quite 
closed. There are a few details in Planck’s map 
that seem out of place: an odd ‘cold patch; for 
example, and a greater proportion of hotspots 
on one side of the sky. Moreover, Planck’s value 
for the Hubble constant, which describes the 
rate of expansion of the Universe, is surpris- 
ingly low compared with estimates made with 
other astronomical techniques — perhaps a 
hint of new physics in play. 

Full confirmation of inflation — and clues 
about what drove it — will depend on the 
detailed properties of the CMB’s photons. The 
wrenching moment of inflation should have 
shaken the very fabric of space-time, resulting 
in gravitational waves. They in turn may have 
left a pattern in the polarization of the photons. 
The Planck team expects to release its polariza- 
tion data early next year. “If we found gravita- 
tional waves, wed get a Nobel prize — it’s a big 
deal,” says George Efstathiou, director of the 
Kavli Institute for Cosmology in Cambridge, 
UK, and one of Planck’s lead researchers. 

The exceedingly faint polarization signal 
may lie beyond the reach of Planck's detectors. 
Ground-based microwave telescopes, such as 
the Keck Array in Antarctica, are also in on 
the hunt, although they are limited to looking 
at one hemisphere of the sky, and in certain 
microwave frequencies, because oxygen in 
Earth’s atmosphere can block some of the CMB 
photons. Charles Lawrence of NASA’ Jet Pro- 
pulsion Laboratory in Pasadena, California, the 
lead Planck scientist in the United States, says 
that it may take another space telescope to fin- 
ish the job, or perhaps even a mission, decades 
away, to detect the gravitational waves directly. 

But in terms of temperature variations, 
Lawrence says, astronomers will have to be 
content with Planck, which “squeezes pretty 
much all the juice out of the CMB”. He finds 
that juice very sweet, even ifit leaves a few ques- 
tions beyond reach. “We have a pretty good 
idea of what the Universe is, but we don't have 
the faintest idea why it is,” says Lawrence, add- 
ing with impish glee: “It’s rather fun, isn’t it?” m 
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A naked woman joined protesters in Rome calling for stem-cell therapy for all incurably ill patients. 


REGENERATIVE MEDICINE 


Stem-cell ruling 
riles researchers 


Italian health minister’s support for a controversial 
treatment appals the country’s scientists. 


BY ALISON ABBOTT 


( ‘re that offer unproven stem-cell 
treatments often end up playing cat 
and mouse with health regulators, 

no matter which country they operate in. In 

Italy, however, one such treatment now has 

official sanction. The country’s health minis- 

ter, Renato Balduzzi, has decreed that a con- 
troversial stem-cell treatment can continue 
in 32 terminally ill patients, mostly children 

— even though the stem cells involved are 

not manufactured according to Italy’s legal 

safety standards. 

The unexpected decision on 21 March has 
horrified scientists, who consider the treat- 
ment to be dangerous because it has never 
been rigorously tested. In the opinion of 
stem-cell researcher Elena Cattaneo of the 
University of Milan: “It is alchemy”. 

The decision followed weeks of media pres- 
sure to authorize compassionate use of the 
therapy, which was developed by the Brescia- 
based Stamina Foundation and has been 
repeatedly banned in the past six years. Now, 
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patient groups are pushing for the treatment 
to be available to anyone with an incurable 
illness. Hundreds protested in Rome on 
23 March, including a naked woman with 
pro-Stamina slogans painted on her skin. 

Stamina Foundation president Davide 
Vannoni, a psychologist at the University 
of Udine, says that the publicity around the 
treatment has won him 9,000 new patients. 
He hopes that further modifications to the 
law will allow him to expand the therapy. 

A month ago, an investigatory television 
programme, The Hyena, reported that chil- 
dren with incurable diseases such as spinal 
muscular atrophy were being denied sup- 
posedly important treatment, and Italian 
show-business personalities joined the call 
to relax rules on stem-cell treatment. 

In Italy, the compassionate use of as-yet- 
unapproved therapies is allowed on an emer- 
gency basis for dying individuals who have 
no other options, and the national health 
service must provide them for free. The law 
requires that health authorities approve the 
quality of such therapies, but some of its terms 
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are ambiguous, says Amedeo Santosuosso, a 
Milanese judge and a professor at the Univer- 
sity of Pavia who specializes in science and law. 
“That has been the underlying problem in the 
Stamina debacle,” he says. “In the case of the 
Stamina Foundation therapy, there is no sug- 
gestion that it might be efficacious, so in my 
opinion compassionate use is not legitimate? 
Vannoni says that he developed the therapy 
after having successful stem-cell treatment for a 
virus-induced facial paralysis in 2004 in Russia. 
He invited a Russian and a Ukrainian scientist 
to Turin to develop the method and says that 
Stamina has since treated 80 or so patients — 
including people with Parkinson's disease, Alz- 
heimer’s and muscle-wasting disorders. He has 
not published the outcomes or precise details of 
his therapy, which uses the mesenchymal stem 
cells from bone marrow that differentiate into 
bone, fat and connective tissue. In his protocol, 
the cells are extracted from patients, manipu- 
lated in the laboratory and then re-infused. 
Vannoni acknowledges that he has not pub- 
lished outcomes but says that the method is far 
from alchemy. Each treatment uses five types 
of cell, he explains, with their claimed charac- 
teristics tuned to replace damaged tissue or to 
secrete molecules that could reduce inflamma- 
tion, fight infection or promote blood-vessel 
growth. ” Whatever the disease, one of the types 
of cell is going to have the right effect,’ he says. 
When a 2007 European Union regulation 
required that stem-cell therapies follow the same 
safety and efficacy rules as pharmaceuticals, 


Vannoni moved his lab to the republic of San 
Marino. “There, rules were not so strict;’ he says. 

But his work had drawn the attention of a 
Turin prosecutor, Raffaele Guariniello, whose 
investigations concluded that Vannoni’s oper- 
ation could be “dangerous to public health”. 
Vannoni says that Guariniello marshalled 
international pressure to stop him working in 
San Marino, so he moved to Trieste, where he 
says Guariniello again stopped his work. 

From there, Vannoni moved to a pub- 
lic hospital in Brescia. Last May, a delega- 
tion including representatives of the Italian 
Medicines Agency (AIFA) and the ISS, the 
health ministry’s national institute, visited the 
Brescia lab and reported chaotic conditions: 
ethics-committee approvals had been based 
on inadequate information, and there were 
no detailed protocols or patient follow-up, for 
example. The AIFA closed the lab, stating that 
the facilities could not be trusted to produce 
contamination-free preparations. 

Patients and families turned to the legal sys- 
tem to allow treatments to continue as compas- 
sionate use; many of the courts concluded that 
it was a patient's right to receive treatment and 
that health services must offer it, and in some 
cases the Brescia lab once again supplied cells. 

Some of the compelled treatments led to the 
only publication of clinical results so far. Clini- 
cians at the Burlo Garafalo Children’s Hospital 
in Trieste treated five babies with type I spinal 
muscular atrophy and published the results 
last October (M. Carrozzi et al. Neuromuscul. 
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Disord. 22, 1032-1034; 2012). They found that 
“the treatment did not change the course of the 
disease’, says co-author Marco Carrozzi. Van- 
noni argues that the therapy failed because the 
clinicians did not use his exact cocktail of cells. 
Setting himself against his own regulatory 
agencies, Balduzzi had earlier angered scien- 
tists when, on 7 March, he authorized contin- 
ued therapy for a three-year-old child with the 
deadly disease metachromatic leukodystrophy 
— provided that the stem cells were created in 
a good manufactur- 


“The treatment ing practice (GMP) 
did not change facility. Thirteen aca- 
the course ofthe demics, including 


Cattaneo and San- 
tosuosso, published 
an open letter to Balduzzi warning him of the 
dangers (see go.nature.com/pb1 wd]; in Italian). 

That authorization was bad enough, says 
Paolo Bianco, a stem-cell scientist at the Uni- 
versity of Rome who co-signed the letter. “Now 
the minister is allowing the non-GMP version 
and saying that an unauthorized, unpublished, 
unknown practice is a ‘treatment” 

Balduzzi’s decree is likely to be his last legis- 
lative act in Italy’s outgoing government, and 
scientists hope that his successor will respect 
the role of the AIFA and other science-based 
agencies. AIFA president Luca Pani declined 
to comment on the political decision but says 
that his agency is sticking to its statements on 
the safety and efficacy of the stem-cell prepara- 
tions from Brescia. “Our ban holds,” he says. m 


disease.” 


Drug-company data 
vaults to be opened 


European agency will publish firms’ clinical-trial results. 


BY DANIEL CRESSEY 


industry to share the results of clinical trials 
have grown ever more intense amid revela- 
tions that high-profile companies have hidden 
crucial data on safety and efficacy. Now Europe 
is moving towards measures that would sig- 
nificantly increase disclosure of such data. 
The European Medicines Agency (EMA) 
in London, which licenses drugs for use in 
the European Union, is developing a policy to 
publish some clinical-trial data submitted by 
companies. And next month, major players in 
the UK medical community will meet to dis- 
cuss the practical problems of data openness. 
The meeting is likely to take place on 


[inssseres calls for the pharmaceutical 


19 April and could feature representatives 
from biomedical charity the Wellcome Trust, 
as well as the Academy of Medical Sciences, 
the Association of the British Pharmaceuti- 
cal Industry and the Association of Medical 
Research Charities. It marks a move from dis- 
cussion to action, says Nicola Perrin, head of 
policy at the Wellcome Trust in London. “We 
should all stop discussing whether [the issue 
is] important or not and start having practi- 
cal discussions about how we move forward,” 
she says. 

The United States already requires that 
clinical trials used to secure drug approvals are 
listed in a public online registry. Other coun- 
tries have rules that encourage registration. 
But some researchers and campaigners fear 


that key details are not getting into the public 
domain, making it difficult to assess a drug’s 
safety and depriving researchers of data. 
Critics note, for example, that in recent years 
the London-based drug company AstraZeneca 
has been mired in legal problems including 
accusations that it concealed data on the side 
effects of its antipsychotic drug Seroquel (que- 
tiapine). And GlaxoSmithKline (GSK), also 
based in London, has paid out billions of dol- 
lars after pleading guilty to charges including 
misrepresenting the safety of its diabetes drug 
Avandia (rosiglitazone) and the effectiveness 
of its antidepressant Paxil (paroxetine) in chil- 
dren. The Cochrane Collaboration, an interna- 
tional group of medical experts, has called for 
Roche in Basel, Switzerland, to open up its data 
on the influenza drug Tamiflu (oseltamivir) so 
that the group can assess the drug’s efficacy. 
Drug companies are making concessions. 
GSK and Roche have agreed to share data with 
scientists whose requests are deemed legitimate. 
But critics say that Roche has not established a 
fully independent group to assess requests. 
Last year, following pressure from cam- 
paigners, the EMA said that it would pro- 
actively publish certain data submitted by drug 
companies. The agency is consulting industry 
and academic researchers and funders on > 
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> how this would work; it is likely to have 
a policy in place by 1 January 2014. 

“Europe is now ahead of the United States 
in this area,” says Lisa Bero, a pharma- 
cologist at the University of California, San 
Francisco, who studies bias in scientific 
publication. “We're all waiting’, she adds, to 
see what will unfold. “This is all very new.’ 

Bero says that the move towards more 
transparency is a victory. But “there’s going 
to bea lot more battles” over how informa- 
tion should be released, how detailed it 
should be, who should control its release 
and who should have access to data that 
might reveal identities of trial participants. 

Campaigners suchas All Trials, an inter- 
national group launched in January to put 
pressure on governments and regulators, 
have demanded that public registration not 
be limited to trials used in drug-approval 
applications and should include full meth- 
ods and results of past and current trials. 

Many medical researchers would be satis- 
fied if summary results were made public, 
ideally as peer-reviewed papers. But some 
want full release of the detailed clinical 
study reports that drug companies submit to 
regulatory agencies. Researchers who con- 
duct meta-analyses by pooling data from 
multiple trials also want access to 
anonymized data on individual participants. 

In 2010, the European Ombudsman 
ruled that detailed data should not be seen 
as commercially confidential. That helped 
to drive development of the EMA’s policy, 
which will increase access to such informa- 
tion. But patient-level data are unlikely to 
be placed wholesale into the public domain, 
and companies, researchers, trial funders 
and patient groups disagree on who should 
decide which researchers can access them. 

Groups also wrestle over who will enforce 
the rules. The UK Health Research Author- 
ity, which oversees standards and regulations 
in medical research, is assessing the practi- 
calities of assigning the job to the same ethics 
comunittees that approve clinical trials. 

Kay Dickersin, director of the Center for 
Clinical Trials at Johns Hopkins Bloom- 
berg School of Public Health in Baltimore, 
Maryland, says that serious legal clout is the 
only way to enforce transparency. Fines are 
not enough, she adds: for serious infringe- 
ments, “someone has to go to jail”. m 


The R/V Falkor can deploy remotely operated submersibles. 


OCEANOGRAPHY 


Private research 
ship makes waves 


Falkor plots a fresh course for ocean studies with Google cash. 


BY ALEXANDRA WITZE 


from the spartan existence endured 

by most academic oceanographers on 
research trips. The privately run research 
vessel features a sauna, a glassed-in lounge and 
a helicopter pad to be kept clear at all times for 
VIP guests. 

All of which isn’t too surprising, as two of 
the VIPs are the Falkor’s benefactors: Google 
board chair Eric Schmidt and his wife Wendy. 
The 83-metre former fishery-protection ves- 
sel was retrofitted by the Schmidt Ocean 
Institute of Palo Alto, California, at a cost of 


Le onboard the R/V Falkor is a far cry 


US$60 million. Supporters say that it adds a 
new capability to US oceanography’s fraying 
infrastructure: an ability to take on riskier pro- 
jects more quickly than federal agencies can. 
“Our hope is to accelerate the pace of 
research,’ says Wendy Schmidt. “It’s not sci- 
ence as usual.” The ship’s first full-length science 
cruise ends on 29 March when it returns to port 
in St Petersburg, Florida, after a three-week trip. 
For some oceanographers, the arrival of the 
Falkor — named after a lucky dragon in the film 
The NeverEnding Story — is a stroke of luck. 
The US research fleet is ageing and shrinking: 
down from 26 ships in the University-National 
Oceanographic Laboratory System in 1995 to 
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19 today. Federal funds for ship time are dwin- 
dling as well. For the first time in 15 years, for 
example, microbiologist Julie Huber of the 
Marine Biological Laboratory in Woods Hole, 
Massachusetts, is relying entirely on private 
money for her studies. A slot on the Falkor in 
September will allow her to test a microbial 
sampler on the sea floor — technology devel- 
opment that she says would have progressed 
more slowly with incremental federal grants. 

There is a long history of millionaires dab- 
bling in oceanography without doing much 
publicly available science. Hollywood director 
James Cameron has a small group of research- 
ers advising him on his deep-sea dives, such as 
his record-breaking trip last year to the Mariana 
Trench. He is donating the submersible used 
on that dive to the Woods Hole Oceanographic 
Institution (WHOD) in Massachusetts, although 
he is holding back the discoveries from that 
plunge for use in a feature film. 

The Schmidt Ocean Institute, established 
in 2009, aims to reach a wider array of sci- 
entists. A parallel set-up, the Marine Science 
and Technology Foundation, also headed by 
Eric Schmidt, supports the development of 
oceanography tools, such as a video recorder 
for plankton and a surface-water analyser 
powered by wind and solar energy. “They 


are accelerating change in a field which was 
already changing gradually,’ says Kim Juniper, 
an oceanographer at the University of Victoria 
in Canada, who will lead a Falkor cruise around 
Vancouver Island in August to study waters 
with low oxygen levels. 

The Falkor can also carry equipment from 
other organizations. In 2014, for instance, 
it will take the WHOI’s deep-diving Nereus 
robot to the Mariana Trench. 

Given the ship’s Internet ties, it is no surprise 
that managers favour projects emphasizing 
open data. Charles Paull, a marine geologist at 
the Monterey Bay Aquarium Research Insti- 
tute in Moss Landing, California, is leading the 
current cruise in the Gulf of Mexico, where his 
team is mapping an underwater limestone cliff 
that bears marks from the meteorite impact 65 
million years ago that is thought to have killed 
the dinosaurs off. Paull plans to post his maps 
on Google Earth within two months — not the 
two years that most oceanographic data take 
to trickle out. 

Data sharing is nota requirement for research 
proposals, says Victor Zykov, Schmidt Ocean's 
director of research, but those that include an 
open-data component are ranked higher than 
those of equal scientific value that do not. 

Falkor cruises are scheduled after the 
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proposals have been peer reviewed by inde- 
pendent scientists. For the round of cruises 
beginning in 2014, the 48 applications made 
were whittled down to seven projects. Sixty- 
one applications have arrived for the cruises 
beginning in 2015, with a final selection of 
seven or so expected late this summer, says 
Zykov. “It's hard to deny that interest is grow- 
ing,” he says. Wendy Schmidt adds that if the 
cruises turn out good science, the institute may 
consider adding a second ship in the future. 

One catch is that although ship time is pro- 
vided for free, researchers must find a way to 
fund their salaries and any post-cruise science. 
For some this isn’t too much of a problem. 
Chris German at the WHOI, for instance, used 
leftover NASA money from an earlier cruise to 
pay for his portion of a Falkor trip this summer 
to the Caribbean, where he will use Nereus to 
hunt for new hydrothermal vents. 

Just about the only other US ship allowed 
such unfettered exploration is the US National 
Oceanic and Atmospheric Administration's 
Okeanos Explorer, German says. But the Falkor, 
with fewer education and outreach obligations, 
can operate with more focus. “It’s a new way of 
doing business,” he says. “The model is theo- 
retically good, but how it’s going to work out 
remains to be seen.” mSEE EDITORIAL P.410 


Sham journals scam authors 


Con artists are stealing the identities of real journals to cheat scientists out of publishing fees. 


BY DECLAN BUTLER 


S cientific publishing, meet cybercrime. 


Two reputable European science jour- 

nals have fallen prey to identity theft by 
criminals who have created counterfeit jour- 
nal websites. These online doppelgangers 
have duped hundreds of researchers into pay- 
ing author fees, with the ill-won gains being 
funnelled to Armenia. 

Editors of the victim journals first learned 
of the scam last year, but their attempts to 
put a stop to it have so far come to nothing. 
The crooked websites are masquerading as 
Archives des Sciences, a multidisciplinary 
journal founded in 1791 and published 
by the Society of Physics and Natural His- 
tory of Geneva (SPHN) in Switzerland; and 
Wulfenia, a botany journal published by the 
Regional Museum of Carinthia in Klagenfurt, 
Austria. 

The scammers attend to the closest of 
details, displaying on multiple websites not 
only the titles of the authentic journals, but 
also their impact factors, postal addresses and 


international standard serial numbers — the 
unique codes used to identify journals. 

Editors of the authentic publications fear 
that the ruse has tainted the reputations of 
their journals. 

“Victims are regularly contacting me to 
ask about the status of their papers: they 
transfer money and don’t see their papers 
published,” says Roland Eberwein, editor-in- 
chief of the authentic Wulfenia and head of 
the Botanic Center at the Carinthia museum, 
which includes a herbarium of more than 
200,000 specimens. 

“We are currently wasting our time trying to 
fight these people,” says Robert Degli Agosti, 
editor-in-chief of Archives des Sciences and a 
plant biologist and electrophysiologist at the 
University of Geneva. 

Neither of the authentic journals has its own 
dedicated website, making them easy prey for 
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imposters. In response to the scam, however, 
the SPHN and the Carinthia museum have 
put warning notices on their home pages, and 
Wulfenia has started publishing its back issues 
online. 

The forged sites look so convincing that they 
initially misled Thomson Reuters, a metrics 
company based in New York that produces the 
Scientific Citation Index and compiles journal 
impact factors. 

But by May last year, the company had 
become suspicious, writing to the SPHN for 
an explanation of the “huge discrepancy” 
between the content of articles in print issues 
of Archives des Sciences — which Thomson 
Reuters indexes — and on the website. It noted, 
too, a discrepancy in publishing frequency: 
“We receive and index 2 issues of each volume 
for each year, while the website is now listing 
12 issues per volume, one each month,’ it wrote 
to the society. 

One of the imposters had even persuaded 
Thomson Reuters to include a link to the 
false journal in its list of indexed publica- 
tions; the company moved swiftly to > 
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remove the link when the scam was 
uncovered. The action triggered “a bar- 
rage of complaints and requests to reac- 
tivate the link from representatives of 
the false journal”, says Marie McVeigh, 
director of content selection at Thomson 
Reuters. She says that the company has 
also received enquiries from customers 
“asking why the articles that had been 
accepted by one of the false journals were 
not appearing in our indexes”. 

“The quality and integrity of our content 
is of the greatest importance to us,’ adds 
McVeigh. 

Ina further impudent touch, the various 
counterfeit Archives des Sciences websites 
list an editorial board with 87 members, 
including Daniel Gamelin, a chemist and 
materials scientist at the University of 
Washington in Seattle, and Gerald Cleaver, 
a high-energy physicist at Baylor Univer- 
sity in Waco, Texas. Both are perplexed — 
and annoyed. “This is the first I have heard 
of this website or of my listing; I have no 
affiliation with this organization, nor have 
I ever,’ says Gamelin. Cleaver, too, says 
that his name is being used without his 
permission. 

The ‘editor-in-chief’ of the fake Archives 
des Sciences journal is named on the coun- 
terfeit websites as “Prof. Dr. Eliana Schmid’, 
with the affiliation “Geneva, Switzerland”. 
The counterfeit Wulfenia sites give as the 
editor-in-chief Vienna S. Franz and list 
35 editorial-board members, with most 
affiliations giving only city and country. 
Eberwein and Degli Agosti think that these 
named editors-in-chief are fictional. 

Researchers who have submitted to the 
fake journals pay dearly. Both the coun- 
terfeit Archives des Sciences and the fake 
Wulfenia charge author fees of more than 
$500, with instructions to address pay- 
ment to accounts at two banks in Yerevan, 
Armenia. 

Degli Agosti has reported the counterfeit 
Archives des Sciences websites to the Cyber- 
crime Coordination Unit Switzerland, but 
was told that the sites were hosted in the 
United States, so the unit could not act 
directly against them. However, it advised 
him that he could press criminal charges 
under Swiss cybercrime laws. The Uni- 
versity of Geneva’s lawyers are helping the 
SPHN to draft its case, but the society is not 
part of the university, so it will have to take 
its case forward alone. 

Austrian police have made little progress 
on the Wulfenia case, says Eberwein: they 
closed down a fake site hosted in Austria, 
but multiple replicas popped up on serv- 
ers in other countries. Austrian authorities 
have told Eberwein that the scope for legal 
redress is limited, but, unconvinced, he has 
contacted Archives des Sciences to swap 
notes. = 
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Physicists have found Higgs-like particles in a superfluid at the Max Planck Institute in Munich, Germany. 


Higgs physics 
on the cheap 


Tabletop Higgs particles may illuminate cosmic cousin. 


BY EUGENIE SAMUEL REICH 


ore than three decades ago, before 
Me: world’s most powerful parti- 

cle collider was even on the draw- 
ing board, two physicists discovered a Higgs 
boson. Ona tabletop. 

In 1981, Peter Littlewood and Chandra 
Varma, two solid-state theorists at Bell Labo- 
ratories in Murray Hill, New Jersey, realized 
that a mysterious effect seen in a niobium 
selenide superconductor could be explained 
by the jiggling of the invisible field that causes 
electrons in the material to pair up and move 
as one without resistance. Mathematically, the 
disturbance in the field looked very like one 
that is associated with the Higgs particle found 
by particle physicists. 

Because the superconducting field was 
already known, the search for the associated 
particle never drew as much attention as the 
hunt by particle physicists for the Higgs. That 
quest culminated on 13 March at the Large 
Hadron Collider (LHC) at CERN, Europe’s 
main particle-physics facility near Geneva 


2013 
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in Switzerland, with the announcement of 
a definitive Higgs detection: the first direct 
evidence of a Higgs field permeating the Uni- 
verse and giving objects mass. 

But physicists agree that the superconduct- 
ing Higgs is closely related to its particle- 
physics cousin. Both arise from the vibration 
of an invisible field that forces ordinary par- 
ticles to oscillate in sync. Now physicists are 
seeing signs that other condensed-matter sys- 
tems can generate Higgs-like particles, raising 
hopes that work on one Higgs, studied cheaply 
on tabletops, can inform the study of another, 
probed bya US$5-billion collider. “Tm hoping 
there will be cross-fertilization, says Varma, 
now at the University of California, Riverside, 
who spoke in a packed session on solid-state 
Higgs particles at a meeting of the American 
Physical Society (APS) in Baltimore, Mary- 

land, on 19 March. 


It wouldnt be the first 
For CERN’s time that particle phys- 
discovery of the ics stood on the shoul- 
Higgs, see: der of condensed-matter 


physics. When Peter 
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HIGGS HUNTING 


Energy scale: 1.25 x 10" eV 
Permeates the Universe 

and gives rise to mass in 
other particles. 


Energy scale: 0.002 eV 
Exists as a jiggling in the field 

..@ describing how superconducting 
electrons pair up. 


eV, electronvolt. 


Higgs, a theorist at the University of Edin- 
burgh, UK, put together the idea known as the 
Higgs field in 1964, he built on a theory devel- 
oped a year earlier, by theorist Phil Anderson, 
now at Princeton University in New Jersey, to 
describe the interactions of superconduct- 
ing electrons. Higgs took the idea further, 
interpreting the field as a medium in empty 
space that would pull on particles, giving them 
mass. The Higgs bosons that made up the field 
would be invisible unless the field was jiggled 
in the right way, as occurs in high-energy col- 
lisions at the LHC. 

Varma and Littlewood identified their 
superconducting Higgs in the same way, by 
recognizing that a wave of slightly displaced 
atoms was jiggling the superconducting field. 
Now researchers are trying to extend their 
catalogue of Higgs particles. At the APS meet- 
ing, physicist Marie- Aude Méasson from the 
University of Paris Diderot and her colleagues 
reported that the mechanism by which the 
Higgs was recognized in niobium selenide 
did not exist in a related superconductor, nio- 
bium sulphide, which lacks the wave of dis- 
placed atoms. Méasson says that she wants to 
find examples of other superconductors that 
can produce Higgs particles. “When you only 
have one example in nature you wonder if it’s 
really what you think,” she says. 

Other researchers are looking for examples 
outside superconductors. Christian Riiegg of 
the Paul Scherrer Institute in Villigen, Switzer- 
land, says that his research group has produced 
Higgs-like particles in an antiferromagnet, in 
which the spins of atoms are opposite to adja- 
cent ones. And physicist Immanuel Bloch of 
the Ludwig Maximilians University in Munich, 
Germany, told scientists at the APS meeting 
that he has created Higgs particles in a cloud 
of chilled rubidium atoms that share the same 
quantum state (a Bose-Einstein condensate). 
Using criss-crossing laser beams, Bloch tunes 
the cloud to a transition point between the 
atoms’ insulating and superfluid states at which 
the Higgs particle emerges. Although the 
speed of light is a factor in equations describ- 
ing the particle-physics Higgs, it is replaced by 
the speed of sound in Bloch’s system, which 
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Physicists are looking for connections between the 
cosmic Higgs boson, discovered in a particle collider, 
and its tabletop cousins. 


Energy scale: 4 x 10° eV 
Exists as a jiggling in the field 
describing the shared quantum 
state of a cloud of atoms. 


Energy scale: Up to 0.0012 eV 
Exists as a jiggling in the 
magnetic ordering of atomic 
spin states. 


means that the energy scale of the particles is 
vastly smaller than that of that of the cosmic 
Higgs (see ‘Higgs hunting’). This mini-Higgs, 
says Bloch, is “a scaled-down version”. 

Research on such solid-state Higgs particles 
might illuminate future directions for particle 
physics, says Subir Sachdev, a physicist at Har- 
vard University in Cambridge, Massachusetts, 
who organized the APS session. A major disap- 
pointment of the Higgs discovered at the LHC 
is its apparently ‘plain vanilla nature. Theo- 
rists had hoped for a composite Higgs that 
would have pointed to the existence of other 
undiscovered particles and forces. But so far 
the Higgs has behaved exactly as expected, 
revealing little about what might lie beyond the 
standard model of particle physics, thought to 
be incomplete because it doesn’t include the 
force of gravity. One obstacle at the LHC is that 
any undiscovered particles might decay almost 
immediately after they are created, too quickly 
for them to be studied. But Bloch’s version of 
a Higgs particle lasts for one six-hundredth of 
a second, which allows him to closely observe 
details of particle decays. 

Particle theorist John Ellis of CERN agrees 
that the study of solid-state Higgs particles 
might shed light on what lies beyond the stand- 
ard model. But he notes that ideas can flow in 
the other direction, too: the mathematical tools 
developed by particle physicists to study strong 
interactions between particles are already 
being used by condensed-matter physicists 
in the hope of explaining high-temperature 
superconductors, in which some think that 
yet more Higgs particles might be lurking. 
“Maybe particle physics has given something 
back,” says Ellis. m 


CORRECTION 

The News Feature ‘Campus 2.0’ (Nature 
495, 160-163; 2013) wrongly said that 
Andrew Ng went public with Daphne 
Koller’s machine-learning course — it was 
his course. Both this course and a database 
course each attracted 100,000 students not 
60,000 as stated. 
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A NEW 
PAGE 


A special issue of 
Nature looks at the 


transformation taking 


place in scientific 
publishing. 


BRENDAN MONROE 


fter 350 years in the slow-moving world 
Ae print, scientific publishing has been 

thrust into a fast-paced online realm of 
cloud computing and ubiquitous sharing. The 
result has been an era of ferment, as estab- 
lished practices are challenged by new ones — 
most notably, the open-access model in which 
the author pays publication fees upfront. Last 
month, US President Barack Obama’s admin- 
istration declared that government-funded 
research papers should be made freely avail- 
able within 12 months of publication (see 
Nature 494, 414-415; 2013). And from 1 April, 
research councils in the United Kingdom will 
require the results of government-funded 
research to be open access on publication. 

In this special issue, Nature explores the 
changing landscape. A News Feature weighs 
claims that online, author-pays publishing can 
drastically cut costs (see page 426). Several 
authors discuss the nuts and bolts of making 
open-access publishing work well — includ- 
ing copyright pioneer John Wilbanks on open 
licensing agreements (see pages 440 and 442). 
A report explores the dark side of open access: 
publishers whose tactics lead authors to feel 
disgruntled or duped (see page 433). Anda 
Careers Feature offers advice for researchers 
trying to balance prestige, cost and career 
implications in deciding where to submit 
manuscripts (see page 539). 

The special also looks at broader aspects of 
publishing. Information scientist Jason Priem 
describes how the concepts of journal and 
article are being superseded by algorithms 
that filter, rate and disseminate scholarship 
as it happens (see page 437). A News Fea- 
ture investigates how some university libraries 
are reinventing themselves to help scientists 
to archive and make accessible a new kind 
of publication: data sets (see page 430). And 
Robert Darnton, director of the library at Har- 
vard University in Cambridge, Massachusetts, 
talks about the soon-to-be launched Digital 
Public Library of America, which could ulti- 
mately hold 5 million books (see page 447). 
Science itself is changing rapidly; the means 
by which it is shared must keep up. = 


THE FUTURE OF PUBLISHING 


A Nature special issue. 
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THE TRUE COST OF 
SCIENCE PUBLISHING 


Cheap open-access journals raise questions about the 
value publishers add for their money. 


BY RICHARD VAN NOORDEN 


ichael Eisen doesnt hold back when invited to vent. “It’s still 

ludicrous how much it costs to publish research — let alone 

what we pay,’ he declares. The biggest travesty, he says, is 

that the scientific community carries out peer review — a 

major part of scholarly publishing — for free, yet subscription-journal 

publishers charge billions of dollars per year, all told, for scientists to 
read the final product. “It’s a ridiculous transaction,” he says. 

Eisen, a molecular biologist at the University of California, Berkeley, 

argues that scientists can get much better value 


journals published by the Public Library of Science (PLoS), which Eisen 
co-founded in 2000. “The costs of research publishing can be much 
lower than people think,’ agrees Peter Binfield, co-founder of one of the 
newest open-access journals, Peer], and formerly a publisher at PLoS. 
But publishers of subscription journals insist that such views are 
misguided — born ofa failure to appreciate the value they add to 
the papers they publish, and to the research community as a whole. 
They say that their commercial operations are in fact quite efficient, 
so that if a switch to open-access publishing 


by publishing in open-access journals, which 
make articles free for everyone to read and 
which recoup their costs by charging authors or 
funders. Among the best-known examples are 


THE FUTURE OF PUBLISHING 


A Nature special issue. 
nature.com/scipublishing 


led scientists to drive down fees by choosing 
cheaper journals, it would undermine impor- 
tant values such as editorial quality. 

These charges and counter-charges have 
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been volleyed back and forth since the open-access idea emerged in 
the 1990s, but because the industry's finances are largely mysterious, 
evidence to back up either side has been lacking. Although journal list 
prices have been rising faster than inflation, the prices that campus 
libraries actually pay to buy journals are generally hidden by the non- 
disclosure agreements that they sign. And the true costs that publishers 
incur to produce their journals are not widely known. 

The past few years have seen a change, however. The number of 
open-access journals has risen steadily, in part because of funders’ 
views that papers based on publicly funded 
research should be free for anyone to read. 
By 2011, 11% of the world’s articles were 
being published in fully open-access jour- 
nals' (see “The rise of open access’). Sud- 
denly, scientists can compare between 
different publishing prices. A paper that 
costs US$5,000 for an author to publish in 
Cell Reports, for example, might cost just 
$1,350 to publish in PLoS ONE — whereas 
Peer] offers to publish an unlimited num- 
ber of papers per author for a one-time 
fee of $299. “For the first time, the author can evaluate the service 
that they're getting for the fee they're paying,” says Heather Joseph, 
executive director of the Scholarly Publishing and Academic Resources 
Coalition in Washington DC. 

The variance in prices is leading everyone involved to question the 
academic publishing establishment as never before. For researchers and 
funders, the issue is how much of their scant resources need to be spent 
on publishing, and what form that publishing will take. For publish- 
ers, itis whether their current business models are sustainable — and 
whether highly selective, expensive journals can survive and prosper 
in an open-access world. 


THE COST OF PUBLISHING 

Data from the consulting firm Outsell in Burlingame, California, sug- 
gest that the science-publishing industry generated $9.4 billion in reve- 
nue in 2011 and published around 1.8 million English-language articles 
— an average revenue per article of roughly $5,000. Analysts estimate 
profit margins at 20-30% for the industry, so the average cost to the 
publisher of producing an article is likely to be around $3,500-4,000. 

Most open-access publishers charge fees that are much lower than 
the industry’s average revenue, although there is a wide scatter between 
journals. The largest open-access publishers — BioMed Central and 
PLoS — charge $1,350-2,250 to publish peer-reviewed articles in many 
of their journals, although their most selective offerings charge $2,700- 
2,900. In a survey published last year”, economists Bo-Christer Bjork 
of the Hanken School of Economics in Helsinki and David Solomon 
of Michigan State University in East Lansing looked at 100,697 articles 
published in 1,370 fee-charging open-access journals active in 2010 
(about 40% of the fully open-access articles in that year), and found that 
charges ranged from $8 to $3,900. Higher charges tend to be found in 
‘hybrid’ journals, in which publishers offer to make individual articles 
free in a publication that is otherwise paywalled (see ‘Price of prestige’). 
Outsell estimates that the average per-article charge for open-access 
publishers in 2011 was $660. 

Although these fees seem refreshingly transparent, they are not the 
only way that open-access publishers can make money. As Outsell notes, 
the $660 average, for example, does not represent the real revenue col- 
lected per paper: it includes papers published at discounted or waived 
fees, and does not count cash from the membership schemes that some 
open-access publishers run in addition to charging for articles. Fre- 
quently, small open-access publishers are also subsidized, with univer- 
sities or societies covering the costs of server hosting, computers and 
building space. That explains why many journals say that they can offer 
open access for nothing. One example is Acta Palaeontologica Polonica, 
a respected open-access palaeontology journal, the costs of which are 
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mostly covered by government subsidies to the Institute of Paleobiology 
of the Polish Academy of Sciences in Warsaw; it charges nothing for 
papers under 10 pages. Another is eLife, which is covered by grants from 
the Wellcome Trust in London; the Max Planck Society in Munich, 
Germany; and the Howard Hughes Medical Institute in Chevy Chase, 
Maryland. And some publishers use sets of journals to cross-subsidize 
each other: for example, PLoS Biology and PLoS Medicine receive subsidy 
from PLoS ONE, says Damian Pattinson, editorial director at PLoS ONE. 

Neither PLoS nor BioMed Central would discuss actual costs 
(although both organizations are profit- 
able as a whole), but some emerging play- 
ers who did reveal them for this article say 
that their real internal costs are extremely 
low. Paul Peters, president of the Open 
Access Scholarly Publishing Association 
and chief strategy officer at the open-access 
N publisher Hindawi in Cairo, says that last 

year, his group published 22,000 articles 

at a cost of $290 per article. Brian Hole, 

founder and director of the researcher-led 

Ubiquity Press in London, says that aver- 
age costs are £200 (US$300). And Binfield says that Peer]’s costs are in 
the “low hundreds of dollars” per article. 

The picture is also mixed for subscription publishers, many of which 
generate revenue froma variety of sources — libraries, advertisers, 
commercial subscribers, author charges, reprint orders and cross- 
subsidies from more profitable journals. But they are even less trans- 
parent about their costs than their open-access counterparts. Most 
declined to reveal prices or costs when interviewed for this article. 

The few numbers that are available show that costs vary widely in 
this sector, too. For example, Diane Sullenberger, executive editor for 
Proceedings of the National Academy of Sciences in Washington DC, 
says that the journal would need to charge about $3,700 per paper 
to cover costs if it went open-access. But Philip Campbell, editor-in- 
chief of Nature, estimates his journal’s internal costs at £20,000-30,000 
($30,000-40,000) per paper. Many publishers say they cannot estimate 
what their per-paper costs are because article publishing is entangled 
with other activities. (Science, for example, says that it cannot break 
down its per-paper costs; and that subscriptions also pay for activities 
of the journal's society, the American Association for the Advancement 
of Science in Washington DC.) 

Scientists pondering why some publishers run more expensive out- 
fits than others often point to profit margins. Reliable numbers are hard 
to come by: Wiley, for example, used to report 40% in profits from its 
science, technology and mathematics (STM) publishing division before 
tax, but its 2013 accounts noted that allocating to science publishing 
a proportion of ‘shared services’ — costs of distribution, technology, 
building rents and electricity rates — would halve the reported profits. 
Elsevier's reported margins are 37%, but financial analysts estimate 
them at 40-50% for the STM publishing division before tax. (Nature 
says that it will not disclose information on margins.) Profits can be 
made on the open-access side too: Hindawi made 50% profit on the 
articles it published last year, says Peters. 

Commercial publishers are widely acknowledged to make larger 
profits than organizations run by academic institutions. A 2008 study 
by London-based Cambridge Economic Policy Associates estimated 
margins at 20% for society publishers, 25% for university publish- 
ers and 35% for commercial publishers’. This is an irritant for many 
researchers, says Deborah Shorley, scholarly communications adviser 
at Imperial College London — not so much because commercial prof- 
its are larger, but because the money goes to shareholders rather than 
being ploughed back into science or education. 

But the difference in profit margins explains only a small part of the 
variance in per-paper prices. One reason that open-access publishers 
have lower costs is simply that they are newer, and publish entirely 
online, so they don’t have to do print runs or set up subscription 
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paywalls (see “How costs break down’). Whereas small start-ups can 
come up with fresh workflows using the latest electronic tools, some 
established publishers are still dealing with antiquated workflows for 
arranging peer review, typesetting, file-format conversion and other 
chores. Still, most older publishers are investing heavily in technology, 
and should catch up eventually. 


THE COST OF PUBLISHING 


JOURNAL PRICES VARY WITH INFLUENCE AND BUSINESS MODEL. 


Price of prestige 
Open-access prices correlate weakly with the 
average influence of a journal’s articles. 


COSTLY FUNCTIONS 

The publishers of expensive journals give two other explanations for 
their high costs, although both have come under heavy fire from advo- 
cates of cheaper business models: they do more and they tend to be 
more selective. The more effort a publisher invests in each paper, and 
the more articles a journal rejects after peer review, the more costly is 
each accepted article to publish. 

Publishers may administer the peer-review process, which includes 
activities such as finding peer reviewers, evaluating the assessments and 
checking manuscripts for plagiarism. They may edit the articles, which 
includes proofreading, typesetting, adding graphics, turning the file into 
standard formats such as XML and adding metadata to agreed indus- 
try standards. And they may distribute print copies and host journals 
online. Some subscription journals have a large staff of full-time editors, 
designers and computer specialists. But not every publisher ticks all 
the boxes on this list, puts in the same effort or hires costly professional 
staff for all these activities. For example, most of PLoS ONE'’s editors are 
working scientists, and the journal does not perform functions such as 
copy-editing. Some journals, including Nature, also generate additional 
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Chart omits open-access journals yet to receive an Article Influence® score. 


*Subscription journals that give option of open-access publishing. {Relative score, in which 1 = global average. See 
www.eigenfactor.org/openaccess for details on how this metric is calculated. 
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How costs break down 


An economic model s| 
open access changes 


hows how switching from subscription to 
the costs of publishing. 


content for readers, such as editorials, commentary articles and journal- 
ism (including the article you are reading). “We get positive feedback 
about our editorial process, so in our experience, many scientists do 
understand and appreciate the value that this adds to their paper,’ says 
David Hoole, marketing director at Nature Publishing Group. 


i = ussi00 ans The key question is whether the extra effort adds useful value, says 
of typesetting Timothy Gowers, a mathematician at the University of Cambridge, 
Subscription and printing Shae UK, who last year led a revolt against Elsevier (see Nature http://doi. 
ene malniieieiien org/kwd; 2012). Would scientists’ appreciation for subscription jour- 
: Subscription and user nals hold up if costs were paid for by the authors, rather than spread 
ONLINE ONLY on aee among subscribers? “If you see it from the perspective of the publisher, 
($3,509) you may feel quite hurt,” says Gowers. “You may feel that a lot of work 
Open access you put in is not really appreciated by scientists. The real question is 

ONLINE ONLY whether that work is needed, and that’s much less obvious.” 
| fy | ft | ($2,289) Many researchers in fields such as mathematics, high-energy 
Bee se physics and computer science do not think it is. They post pre- and 
HoAboww NS post-reviewed versions of their work on servers such as arXiv — an 
Shane eae operation that costs some $800,000 a year to keep going, or about $10 
HEE BREE per article. Under a scheme of free open-access “Episciences journals 
Bee oot | TT | in proposed by some mathematicians this January, researchers would 
rT Tt rT TTT TT TTT organize their own system of community peer review and host research 
TT TTT =H rt tdi on arXiv, making it open for all at minimal cost (see Nature http://doi. 
7 7 = = — = — 7 — — = = = = _ A es suit communities that have a culture of sharing 


Voluntary peer review (not counted in price) 


Additional cost if reviewers were paid for their time. 


preprints, and that either produce theoretical work or see high scrutiny 
of their experimental work — so it is effectively peer reviewed before it 
even gets submitted to a publisher. But they find less support elsewhere 
— in the highly competitive biomedical fields, for instance, researchers 


Hi Article processing : ; : 
Administering peer review (assuming average rejection rate of 50%); tend not to publish preprints for fear of being scooped and they place 
editing; proofreading; typesetting; graphics; quality assurance. more value on formal (journal-based) peer review. “If we have learned 

i) Other costs hima anything in the open-access movement, it’s that not all scientific com- 
Covers, indexes and editorial; rights management; sales and payments; iti ted th : ized + fit all” h 
printing and delivery; online user management; marketing and munities are created the same: one size doesnt fit all,” says Joseph. 
communications; helpdesk; online hosting. 

_ Management and investment THE VALUE OF REJECTION 


Data from J. Houghton et al. Economic implications of alternative scholarly publishing models (Joint Information 


Includes cost to establish journal: assumed 20% subscription; 


15% open access. 


Margin 


Assumed 20% subscription; 15% open access. 


Systems Committee, 2009). available at go.nature.com/uqrxqw. 
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Tied into the varying costs of journals is the number of articles that 
they reject. PLoS ONE (which charges authors $1,350) publishes 70% 
of submitted articles, whereas Physical Review Letters (a hybrid journal 
that has an optional open-access charge of $2,700) publishes fewer 
than 35%; Nature published just 8% in 2011. 

The connection between price and selectivity reflects the fact 
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that journals have functions that go beyond just publishing articles, 
points out John Houghton, an economist at Victoria University in 
Melbourne, Australia. By rejecting papers at the peer-review stage 
on grounds other than scientific validity, and so guiding the papers 
into the most appropriate journals, publishers filter the literature and 
provide signals of prestige to guide readers’ attention. Such guidance 
is essential for researchers struggling to identify which of the millions 
of articles published each year are worth looking at, publishers argue 
— and the cost includes this service. 

A more-expensive, more-selective journal should, in principle, 
generate greater prestige and impact. Yet in the open-access world, 
the higher-charging journals don’t reliably command the greatest 
citation-based influence, argues Jevin West, a biologist at the Uni- 
versity of Washington in Seattle. Earlier this 
year, West released a free tool that researchers 
can use to evaluate the cost-effectiveness of 
open-access journals (see Nature http://doi. 
org/kwh; 2013). 

And to Eisen, the idea that research is fil- 


tered into branded journals before it is pub- Mi Immediate OA Hl Delayed OA [lj Non-OA 


Millions of articles: 


lished is not a feature but a bug: a wasteful 
hangover from the days of print. Rather than 2.0 
guiding articles into journal ‘buckets’, he 
suggests, they could be filtered after publi- 
cation using metrics such as downloads and 4 
citations, which focus not on the antiquated 
journal, but on the article itself (see page 437). 

Alicia Wise, from Elsevier, doubts that this 
could replace the current system: “I don't 
think it’s appropriate to say that filtering and 0.8 
selection should only be done by the research 


1.8 


ie) 


fo) 


community after publication,’ she says. She ve 
argues that the brands, and accompanying fil- 04 
ters, that publishers create by selective peer 


review add real value, and would be missed if 0.2 
removed entirely. 

PLoS ONE supporters have a ready answer: 
start by making any core text that passes peer 
review for scientific validity alone open to 
everyone; if scientists do miss the guidance 
of selective peer review, then they can use recommendation tools and 
filters (perhaps even commercial ones) to organize the literature — 
but at least the costs will not be baked into pre-publication charges. 

These arguments, Houghton says, are a reminder that publishers, 
researchers, libraries and funders exist in a complex, interdependent 
system. His analyses, and those by Cambridge Economic Policy Asso- 
ciates, suggest that converting the entire publishing system to open 
access would be worthwhile even if per-article-costs remained the 
same — simply because of the time that researchers would save when 
trying to access or read papers that were no longer lodged behind 


paywalls. 
THE PATH TO OPEN ACCESS 


But a total conversion will be slow in coming, because scientists still 
have every economic incentive to submit their papers to high-prestige 
subscription journals. The subscriptions tend to be paid for by campus 
libraries, and few individual scientists see the costs directly. From their 
perspective, publication is effectively free. 

Of course, many researchers have been swayed by the ethical argu- 
ment, made so forcefully by open-access advocates, that publicly 
funded research should be freely available to everyone. Another impor- 
tant reason that open-access journals have made headway is that librar- 
ies are maxed out on their budgets, says Mark McCabe, an economist 
at the University of Michigan in Ann Arbor. With no more library cash 
available to spend on subscriptions, adopting an open-access model 
was the only way for fresh journals to break into the market. New 
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THE RISE OF OPEN ACCESS 


Immediate open access (OA) made 
up 12% of articles in 2011*. 
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funding-agency mandates for immediate open access could speed the 
progress of open-access journals. But even then the economics of the 
industry remain unclear. Low article charges are likely to rise if more- 
selective journals choose to go open access. And some publishers warn 
that shifting the entire system to open access would also increase prices 
because journals would need to claim all their revenue from upfront 
payments, rather than from a variety of sources, such as secondary 
rights. “I've worked with medical journals where the revenue stream 
from secondary rights varies from less than 1% to as muchas one-third 
of total revenue,’ says David Crotty of Oxford University Press, UK. 

Some publishers may manage to lock in higher prices for their pre- 
mium products, or, following the successful example of PLoS, large 
open-access publishers may try to cross-subsidize high-prestige, 
selective, costly journals with cheaper, high- 
throughput journals. Publishers who put out 
a small number of articles in a few mid-range 
journals may be in trouble under the open- 
access model if they cannot quickly reduce 
costs. “In the end,” says Wim van der Stelt, 
executive vice president at Springer in Doet- 
inchem, the Netherlands, “the price is set by 
what the market wants to pay for it” 

In theory, an open-access market could 
drive down costs by encouraging authors to 
weigh the value of what they get against what 
they pay. But that might not happen: instead, 
funders and libraries may end up paying the 
costs of open-access publication in place of 
scientists — to simplify the accounting and 
maintain freedom of choice for academics. 
Joseph says that some institutional libraries 
are already joining publisher membership 
schemes in which they buy a number of free 
or discounted articles for their researchers. 
She worries that such behaviour might reduce 
the author’s awareness of the price being paid 
to publish — and thus the incentive to bring 
costs down. 

And although many see a switch to open 
access as inevitable, the transition will be 
gradual. In the United Kingdom, portions of grant money are being 
spent on open access, but libraries still need to pay for research pub- 
lished in subscription journals. In the meantime, some scientists are 
urging their colleagues to deposit any manuscripts they publish in 
subscription journals in free online repositories. More than 60% of 
journals already allow authors to self-archive content that has been 
peer-reviewed and accepted for publication, says Stevan Harnad, a vet- 
eran open-access campaigner and cognitive scientist at the University 
of Quebec in Montreal, Canada. Most of the others ask authors to wait 
for a time (say, a year), before they archive their papers. However, the 
vast majority of authors don't self-archive their manuscripts unless 
prompted by university or funder mandates. 

As that lack of enthusiasm demonstrates, the fundamental force driv- 
ing the speed of the move towards full open access is what research- 
ers — and research funders — want. Eisen says that although PLoS 
has become a success story — publishing 26,000 papers last year — it 
didn’t catalyse the industry to change in the way that he had hoped. “I 
didn’t expect publishers to give up their profits, but my frustration lies 
primarily with leaders of the science community for not recognizing 
that open access is a perfectly viable way to do publishing,” he says. m 


Richard Van Noorden is an assistant news editor at Nature. 
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THE DARK SIDE 
OF PUBLISHING 


The explosion in open-access publishing has 
fuelled the rise of questionable operators. 


pam e-mails changed the life of 
S Jeffrey Beall. It was 2008, and Beall, 

an academic librarian and a researcher at the University of 
Colorado in Denver, started to notice an increasing flow of messages 
from new journals soliciting him to submit articles or join their edi- 
torial boards. “I immediately became fascinated because most of the 
e-mails contained numerous grammatical errors,’ Beall says. He started 
browsing the journals’ websites, and was soon convinced that many of 
the journals and their publishers were not quite what they claimed. The 
names often sounded grand — adjectives such as ‘world, ‘global’ and 
‘international’ were common — but some sites looked amateurish or 
gave little information about the organization behind them. 

Since then, Beall has become a relentless watchdog for what he 
describes as “potential, possible or probable predatory scholarly open- 
access publishers’, listing and scrutinizing them on his blog, Scholarly 
Open Access. Open-access publishers often 


BY DECLAN BUTLER 


without providing all the expected publishing 
services. These publishers, Beall says, typically 
display “an intention to deceive authors and readers, and a lack of trans- 
parency in their operations and processes”. 

Beall says that he regularly receives e-mails from researchers unhappy 
about their experiences with some open-access journals. Some say that 
they thought their papers had been poorly peer reviewed or not peer 
reviewed at all, or that they found themselves listed as members of edi- 
torial boards they had not agreed to serve on. Others feel they were not 
informed clearly, when submitting papers to publishers, that publication 
would entail a fee — only to face an invoice after the paper had been 
accepted. According to Beall, whose list now includes more than 300 
publishers, collectively issuing thousands of journals, the problem is get- 
ting worse. “2012 was basically the year of the predatory publisher; that 
was when they really exploded, says Beall. He estimates that such outfits 
publish 5-10% of all open-access articles. 


collect fees from authors to pay for peer review, bgt 
editing and website maintenance. Beall asserts 
that the goal of predatory open-access publish- 


ers is to exploit this model by charging the fee 


THE FUTURE OF PUBLISHING 


A Nature special issue. 
nature.com/scipublishing 


Beall’s list and blog are widely read by librar- 
ians, researchers and open-access advocates, 
many of whom applaud his efforts to reveal 
shady publishing practices — ones that, they 
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worry, could taint the entire open- 
access movement. “I think Beall 
has taken a brave and principled 
stand in publishing this, at no 
small risk to himself’ says Douglas 
Sipp, an expert in science policy 
and ethics at the RIKEN Center for 
Developmental Biology in Kobe, 
Japan, who studies the open-access 
movement in Asia. 

Beall says that he has been the 
target of vicious online comments, 
and last December he was the 
subject of an online campaign to 
create the false impression that he 
was extorting fees from publishers 
to re-evaluate their status on his 
list. The Canadian Center of Sci- 
ence and Education, a company 
based in Toronto that publishes 
many open-access journals and is 
on Beall’s list, is now threatening 
to sue him for alleged defama- 
tion and libel. But even some 
experts in scholarly publishing 
are uncomfortable with Beall’s 
blacklist, arguing that it runs the 
risk of lumping publishers that are 
questionable together with those 
that could be bona fide start-ups 
simply lacking experience in the 
publishing industry. Matthew Cockerill, manag- 
ing director of BioMed Central, an open-access 
publisher based in London, says that Beall’s list 
“identifies publishers which Beall has concerns 
about. These concerns may or may not be justified” 


RISING TIDE 

As a research librarian, Beall has been in prime 
position to watch the dramatic changes that have 
taken place in scientific publishing since the rise 
of the open-access movement about a decade 
ago. In the conventional subscription-based 
model, journals bring in revenue largely through 
selling print or web subscriptions and keeping 
most online content locked behind a paywall. But 
in the most popular model of open access, pub- 
lishers charge an upfront ‘author fee’ to cover costs 
— and to turn a profit, in the case of commercial publishers — then 
make the papers freely available online, immediately on publication. 

The open-access movement has spawned many successful, well- 
respected operations. PLOS ONE, for example, which charges a fee of 
US$1,350 for authors in middle- and high-income countries, has seen 
the number of articles it publishes leap from 138 in 2006 to 23,464 last 
year, making it the world’s largest scientific journal. The movement has 
also garnered growing political support. In the past year, the UK and 
US governments, as well as the European Commission, have thrown 
their weight behind some form of open-access publishing. And scarcely 
a week goes by without the appearance of new author-pays, open-access 
publishers, launching single journals or large fleets of them. 

Many new open-access publishers are trustworthy. But not all. 
Anyone with a spare afternoon and a little computing savvy can launch 
an impressive-looking journal website and e-mail invitations to scien- 
tists to join editorial boards or submit papers for a fee. The challenge for 
researchers, and for Beall, is to work out when those websites or e-mail 
blasts signal a credible publisher and when they come from operations 
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Jeffrey Beall investigates potentially shady practices in open-access publishing. 


“2012 WAS THE 
YEAR OF THE 
PREDATORY 
PUBLISHER; 
THAT WAS 
WHEN THEY 
REALLY 
EXPLODED.” 


that can range from the outright 
criminal to the merely amateurish. 

In one e-mail that Beall received 
and shared with Nature, a dental 
researcher wrote that she had sub- 
mitted a paper to an open-access 
journal after she “was won over 
by the logos of affiliated databases 
on the home page and seemingly 
prestigious editorial board” But 
the researcher, who prefers to 
remain anonymous, says that she 
became concerned about the peer- 
review process when the article 
was accepted within days and she 
was not sent any reviewers’ com- 
ments. She says that last week — 
several months after her original 
submission — she was sent page 
proofs that match the submitted 
manuscript, and that she still has 
not seen reviewers’ comments. 

Complaints like this prompted 
Beall to coin the term predatory 
publisher and to compile his first 
list of them, which he published 
in 2010. He now estimates that 
his zeal for investigating publish- 
ers takes up 20-25 hours a week, 
squeezed in around his day job. 
Beall says that he is motivated 
partly by his sense of duty, as an academic librar- 
ian, to evaluate online resources and to help 
patrons to “recognize scholarly publishing scams 
and avoid them’, and partly by the “private and 
very positive feedback” he receives from research- 
ers and librarians. 

But Bealls critics assert that he often relies heavily 
on analysis of publishers’ websites rather than 
detailed discussions with publishers, and that this 
might lead to incorrect or premature conclusions. 
“One of the major weaknesses of Jeffrey Beall’s 
methodology is that he does not typically engage in 
direct communication with the journals that he has 
classified as predatory,’ says Paul Peters, chief strat- 
egy officer at Hindawi Publishing Corporation, 
based in Cairo, and president of the Open Access 
Scholarly Publishers Association (OASPA), based. 
in The Hague, the Netherlands. A set of Hindawi’s journals appeared 
ona version of Beall’s list because he had concerns about their editorial 
process, but has since been removed. “I reanalysed it and determined that 
it did not belong on the list,” he says. “It was always a borderline case.” 

Another concern, say Beall’s critics, is that he risks throwing undue 
suspicion on start-up publishers. “Although rapid launches of many 
journals may well correlate negatively with journal quality, it is certainly 
not enough in and of itself to warrant describing a publisher as preda- 
tory, says Cockerill, who is also a board member of the OASPA. 
“Similarly, some publishers identified on Beall’s list are guilty of poor 
copy-editing and user-interface design on their websites,” he says. 
“Again, this is, at best, circumstantial evidence for problems with the 
scholarly standard of the material they publish.” 

OMICS Group, based in Hyderabad, India, is on Beall’s list. One 
researcher complained in an e-mail to Beall that she had submitted 
a paper to an OMICS journal after receiving an e-mail solicitation — 
but learned that she had to pay a fee to publish it only from a message 
sent by the journal after the paper had been accepted. “To my horror, 
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I opened the file to find an 
invoice for $2,700!” she wrote. 
“This fee was not mentioned 
anywhere obvious at the time 
I submitted my manuscript.” 
(Nature was unable to contact 
this researcher.) Beall says that 
OMICS journals do not show their 
author fees prominently enough 
on their journal websites or in 
e-mails that they send to authors 
to solicit manuscript submissions. 

Srinubabu Gedela, director of 
OMICS Group, says that article- 
handling fees are displayed clearly 
on the ‘Instructions for Authors’ 
web page for each OMICS jour- 
nal. Gedela adds that he would 
assume researchers would be 
aware that such open-access 
journals charge author fees. He 
says that OMICS Group is “not 
predatory” and that its staff and 
editors are acting in “good faith 
and confidence” to promote open- 
access publishing. 

Publishers in developing count- 
ries and emerging economies are 


BUYER BEWARE 


How to perform due diligence before submitting to a journal or 
publisher. 


@ Check that the publisher provides full, verifiable contact 
information, including address, on the journal site. Be cautious of 
those that provide only web contact forms. 

@ Check that a journal’s editorial board lists recognized experts 
with full affiliations. Contact some of them and ask about their 
experience with the journal or publisher. 

@ Check that the journal prominently displays its policy for author 
fees. 

@ Be wary of e-mail invitations to submit to journals or to become 
editorial board members. 

@ Read some of the journal’s published articles and assess their 
quality. Contact past authors to ask about their experience. 

@ Check that a journal’s peer-review process is clearly described 
and try to confirm that a claimed impact factor is correct. 

@ Find out whether the journal is a member of an industry 
association that vets its members, such as the Directory of Open 
Access Journals (www.doaj.org) or the Open Access Scholarly 
Publishers Association (www.oaspa.org). 

@ Use common sense, as you would when shopping online: if 
something looks fishy, proceed with caution. D.B. 
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the papers so far has entailed 

extensive corrections but has 

not been “exceptional” com- 

pared with his experiences 

at other journals, Warhurst 
says. The papers “had new 
findings or findings useful in their 
geographical context, but needed 
help with presentation — mainly 
language and analysis.” Ashry Aly, 
director of Ashdin Publishing, says 
that the company is not a preda- 
tory publisher. 

Beall accepts that the publishers 
on his list fall along a spectrum, 
with some being worse than 
others, but he strongly defends 
his methods. He denies that he 
doesn't make sufficient efforts to 
contact publishers, arguing that 
many of them — who often can 
be contacted only through a web 
form — never respond. When it 
comes to publishers in developing 
countries, he says: “Look, when I 
discover a new publisher from 
Nigeria, I admit I am more sus- 
picious than I would be were the 


at particular risk of being unfairly 

tarred by Beall’s brush, critics say. 

Many open-access publishers are springing up in India and China, for 
example, where swelling researcher ranks are creating large publishing 
markets. Pressure to publish is often intense in developing countries, and 
vanity presses could attract unscrupulous researchers keen to pad out their 
CVs. But respectable domestic publishers could have an important role by 
helping to address local science issues, such as those related to crops, 
diseases or environmental problems. 

“Tt is important that criteria for evaluating publishers and journals 
do not discriminate [against] publishers and journals from other 
parts of the world,” says Lars Bjornshauge, managing director of the 
Directory of Open Access Journals (DOAJ), based in Copenhagen, 
which lists open-access journals that have been reviewed for quality. 
New publishing outfits may legitimately use aggressive marketing 
tactics to recruit authors, and they may have yet to polish their websites, 
editorial boards and peer-review procedures. 

“Some are embarrassingly, toe-cringingly amateurish, but predatory 
is a term that, I think, implies intent to deceive,” says Jan Velterop, a 
former science publisher at Nature Publishing Group and elsewhere 
who is now working with several start-ups to promote innovative ways 
to publish science data. Damage could be done if “a damning verdict 
is given to otherwise honest, though perhaps amateurish, attempts to 
enter the publishing market’, he says. 


QUESTIONING THE VERDICT 

For researchers involved in journals whose publishers have appeared 
on Beall’s blacklist, the verdict can be unsettling. David Warhurst, a 
malaria researcher at the London School of Hygiene and Tropical Medi- 
cine, is the unpaid editor-in-chief of Malaria Chemotherapy, Control & 
Elimination, an open-access, peer-reviewed journal owned by Ashdin 
Publishing, a company that has offices in Cairo and Honnelles, Belgium, 
and that is on Bealls list. 

Warhurst says that he was initially reluctant to take up the invitation he 
received two years ago to become the journal's editor-in-chief, because he 
found that the publishers did not have a journal registered with the pub- 
lication index PubMed. But “certainly I do not believe that this is a toxic 
journal’, he says. The journal is still in its launch phase, and refereeing of 


publisher from, for example, the 
Vatican.” But, he says, “I try to be 
as fair and honest as I can be when I am making judgements”. 

Beall says that he usually gives blog posts a ‘cooling-off’ period 
between writing and publishing them. Last month, he announced an 
appeals process in which a three-person advisory board will conduct a 
blinded review and then recommend whether the publisher or journal 
should stay on the list. And to improve transparency, Beall last August 
posted a set of criteria that he says he uses to assess publishers, includ- 
ing an evaluation of their content and practices based on standards 
established by organizations such as the OASPA. Rick Anderson, a 
library dean at the University of Utah in Salt Lake City, says that Beall’s 
criteria “make a lot of sense” and also allow for distinctions between 
truly exploitative publishers and those that are just sloppy. 

Bjornshauge feels that the entire problem needs to be kept in perspec- 
tive. He estimates that questionable publishing probably accounts for 
fewer than 1% of all author-pays, open-access papers — a proportion 
far lower than Beall’s estimate of 5-10%. Instead of relying on blacklists, 
Bjornshauge argues, open-access associations such as the DOAJ and the 
OASPA should adopt more responsibility for policing publishers. He 
says that they should lay out a set of criteria that publishers and journals 
must comply with to win a place on a ‘white list’ indicating that they are 
trustworthy. The DOAJ, he says, is now compiling a list of new, more 
stringent criteria. To help clean up practices, he adds, research funders 
should pay author fees only to such white-listed publishers. Meanwhile, 
he urges researchers to be as cautious when shopping online for publish- 
ers as when shopping for anything else (see ‘Buyer beware’). “Examine 
the company you are about to deal with,” he says. 

Beall says that he supports such efforts. “If someone can figure out a 
better way, that would be great, and I will defer to them,” he says. “I wish 
them success.” But he is sceptical about whether a white list would be 
able to keep up with the surge of new publishers, and believes that his 
blacklist provides more immediate warning. That, however, depends 
on whether he can keep up. “I did not expect and was unprepared for 
the exponential growth of questionable publishers that has occurred in 
the past two years,” he says. = 


Declan Butler is a senior reporter for Nature based in France. 
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Beyond the paper 


The journal and article are being superseded by algorithms that filter, rate and 


disseminate scholarship as it happens, argues Jason Priem. 


enry Oldenburg created the first 
H scientific journal in 1665 with a 
simple goal: apply an emerging 
communication technology — the print- 
ing press — to improve the dissemination of 
scholarly knowledge. The journal was a vast 
improvement over the letter-writing system 
that it eventually replaced. But it had a cost: 
no longer could scientists read everything 
someone sent them; existing information 
filters became swamped. 
To solve this, peer and editorial review 
emerged as a filter, becoming increasingly 
standardized in the science boom after the 


Second World War. This peer-review system 
applies community evaluation of scholarly 
products by proxy: editorial boards, editors 
and peer reviewers are nominated to enact 
representative judgements on behalf of their 
communities. 

Now we are witnessing the transition to 
yet another scholarly communication sys- 
tem — one that will harness the technology 


THE FUTURE OF PUBLISHING 


A Nature special issue. 
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of the Web to vastly improve dissemination. 
What the journal did for a single, formal 
product (the article), the Web is doing for 
the entire breadth of scholarly output. The 
article was an attempt to freeze and mount 
some part of the scholarly process for dis- 
play. The Web opens the workshop windows 
to disseminate scholarship as it happens, 
erasing the artificial distinction between 
process and product. 

Over the next ten years, the view 
through these open windows will inform 
powerful, online filters; these will dis- 
til communities’ impact judgements 
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> algorithmically, replacing the peer- 
review and journal systems. 


THE PRESENT 

This move towards a more diverse set of out- 
puts has already begun. Scholars now share 
their research data in repositories such as 
GenBank, Dryad and figshare (figshare is 
supported by Digital Science, which is owned 
by the same parent company as Nature). They 
use repositories such as GitHub to share code, 
analyses and even ‘executable papers’ that 
automatically knit together preloaded data, 
analysis and prose. They challenge the tradi- 
tional article format by including blog posts, 
interactive graphics and video. And perhaps 
most significantly, academics are moving 
informal scholarly conversations from the 
faculty lounge to social media platforms such 
as Twitter’. In the next ten years, most schol- 
ars will join such networks, driven by both the 
value of improved networking and the fear of 
being left out ofimportant conversations. 

This shift from a paper-native to a 
Web-native system has three significant 
consequences. First, the flow of scholarly 
information is expanding by orders of mag- 
nitude, swamping our paper-based filtering 
system. Like journal articles, Web-native 
products are of inconsistent quality and 
require winnowing; unlike journal articles, 
the scale of these products overwhelms 
attempts at manual curation. In the late 1990s, 
commercial Internet services such as Yahoo! 
found that hiring experts to create vetted lists 
of web pages completely failed at Web scale; 
the same will be true of scholarship. 

Second, the Web era is exposing the deli- 
cate tracework of ideas beneath the formal 
structures of the academy’. The ephemera 
of scholarship are reified in a constellation of 
data points: views on figshare, mentions in 
discussions between colleagues on Twitter, 
saves in a reference manager such as Zotero 
or Mendeley, citations in an open-access 
preprint, recommendations on Faculty of 
1000, and many more. We can use these 
and other tracers of impact to generate new 
metrics of scholarly influence. Informed by 
these alternative metrics, or ‘altmetrics, we 
will draw new maps of scholarly contribu- 
tion, unprecedented in subtlety, texture and 
detail. Suddenly, the rocky plain of ideas — 
once navigated using cairns of citation — is 
covered in fresh snow. In the Web era, schol- 
arship leaves footprints. 

The third consequence proceeds from 
the first two. The editors and reviewers 
employed as proxy community assessors 
will be replaced by the aggregated, collective 
judgements of communities themselves. The 
information-overload problem supplies its 
own solution. This is the power behind the 
Webs great filter, Google. While Yahoo! was 
applying paper-native expert curation to the 
Web, Google was aggregating the collective 
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authority judgements inherent in the 
structure of the network. Its PageRank algo- 
rithm weights hyperlinks from authoritative 
sources more heavily. To find which sources 
count as authoritative, the same algorithm is 
applied to each of the source’s inbound links, 
and so on. This simple recursive algorithm 
has proved remarkably effective, and requires 
minimal manual tuning. It simply harnesses 
the quality judgements already being made 
by the community, implicit in their decisions 
to link to other pages. This core approach is 
also the future for scholarly communication. 


THE FUTURE 

Today’s publication silos will be replaced by 
a set of decentralized, interoperable services 
that are built ona core infrastructure of open 
data and evolving standards — like the Web 
itself (see “Reconstructing publishing’). This 
‘decoupled journal’** publishes promiscu- 
ously, then subjects products to rigorous 
review through the aggregated judgements 
of expert communities, supporting both 
rapid, fine-grained filtering and consistent, 
meaningful evaluation. 

Dissemination. The concept of ‘pre-pub- 
lication’ will be forgotten as frictionless, Web- 
based dissemination increasingly pervades 
the research process. Conversations, data 
collection, analysis and description will be 
born published. Many researchers are already 
practising this ‘open-notebook science’. For 

instance, popula- 


“Tools are tion biologist Carl 
emerging to Boettiger describes 
facilitate this his day’s research and 
‘share early, provides in-progress 
share often’ code, analysis and 


writing, and chemist 
Jean-Claude Bradley's 
lab at Drexel University in Philadelphia, 
Pennsylvania, publishes its entire output 
in near-real time. New tools are emerging 
to facilitate this ‘share early, share often’ 
approach. The journal Push, for example, lets 
scholars build journal articles incrementally, 
with each version tracked and open online, 
available for collaboration and comment 
throughout (see http://push.cwcon.org). 

As the former constituents of the article 
— data, tables, figures, reference lists and 
so on — fracture and dissolve into the fluid 
Web, they will leave behind the core of the 
article: the story. Authors will create these 
stories like blog posts: lean, fast and heavily 
reliant on the published tools, data and anal- 
yses of the entire community. These stories 
will carry forward the synthesis and narra- 
tive functions of the traditional article. Like 
all decoupled journal products, they will 
be openly available, redundantly archived, 
easily attributable and enveloped in an open 
network of links, comments, citations, anno- 
tations, discussions, saves and other interac- 
tions. 


approach.” 
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Certification. Authors will select from 
among diverse paths of certification for their 
research products. For specialized products 
such as daily lab results, no certification is 
needed — use of these products will be lim- 
ited to narrow subfields in which researchers 
are qualified and motivated to consume unfil- 
tered results’. For products aimed at broader 
audiences, authors will rely on algorithmic 
filters to flag their work to others. The Jour- 
nal of Digital Humanities, for example, does 
not take submissions; rather, it highlights the 
best content already published online, often 
pulling from relatively obscure blogs and 
web pages. Importantly, this selection process 
relies heavily on altmetrics (such as number of 
page views, tweets and trackbacks) as a first- 
pass filter before manual curation. 

Similarly, forums such as MathOverflow 
use community-bestowed votes to promote 
or demote content. Many MathOverflow 
posts have been cited in the peer-reviewed 
mathematics literature, and some math- 
ematicians have begun to report high 
MathOverflow reputation scores as evidence 
of scholarly standing. 

Qualitative peer review will move into the 
open and become yet another public schol- 
arly product — complete with its own altmet- 
rics, citations and even other peer reviews. 
The reviewer will metamorphose from gate- 
keeper to interlocutor and collaborator. 

Start-up services such as Rubriq already 
offer ‘standalone peer review, sold to authors 
as a tool to improve their work and help it 
to find its best audience. As peer review is 
unhitched from the creaky machinery of 
paper-native publishing, many more start-ups 
will offer other innovative variations. Some 
may specialize in statistical review or copy 
editing, or emphasize a quick turnaround. 
Many will offer automated review derived 
from altmetrics, while others will leverage 
existing journal brands. 

Authors will select the best services for 
each product and purpose, pinning certi- 
fications to their products’ web pages like 
county-fair ribbons. For instance, an arti- 
cle might receive a ‘B+ overall significance’ 
badge from Nature, a ‘statistically rigorous’ 
badge from a specialist statistical review ser- 
vice and a ‘highly read’ or ‘highly influential 
badge from an altmetrics aggregator. 

Filtering. To channel this torrent of 
Web-native scholarship, readers will rely 
on personalized recommendation engines 
to aggregate both altmetrics and qualita- 
tive badges, producing a bespoke, curated 
stream. Imagine every morning getting an 
e-mail from your personal filtering service 
with the five most important things for you 
to read that day. Content from your narrow- 
est subfield — say, the evolution of toxins in 
Scorpaeniformes (an order of ray-finned 
fish) — is relatively unfiltered, resembling 
a simple RSS feed. At this level of detail, 
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RECONSTRUCTING PUBLISHING 


In future, authors and funders will select services that are best for a given research product (for example, 

A and B). The product will be stored in and disseminated from an institutional repository, and given a unique 
identifier. Most traditional journal functions will be performed independently by an ecosystem of service 
providers. Some examples of services are labelled below. 
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& Product A, such as a data set 


information scarcity is as much a problem 
as information overload, and you want to 
stay aware of nearly everything your closest 
collaborators and competitors do. As you 
move out in scope (to venomous fish, then 
fish, then marine biology, for example), the 
content becomes increasingly filtered: you 
see only products that have been read, dis- 
cussed, saved, cited and recommended by 
the other narrow subdisciplinary commu- 
nities that are (like you) reading everything 
in their specific areas. These recommenda- 
tions constantly improve as the system learns 
from your interactions (as Google search 
does already), but also allow for serendipity 
by delivering a customizable percentage of 
unexpected content from outside your nar- 
row circles. Given such a system, why would 
you ever bother reading another journal? 
Reward. The reward structure of scholar- 
ship will change too. Built on tools such as 
ImpactStory (of which I am a co-founder), 
scholars’ CVs will transform from static lists 
of formal products to real-time displays of 
transparent, normalized metrics that reveal 
diverse impacts. Funders, ever eager to max- 
imize value for their money, will demand 
more Web-native scholarly production. The 
US National Science Foundation, for exam- 
ple, has already begun requesting “products” 
rather than “articles” in the biographical- 
sketch section of grant applications®. The 
widespread availability of open altmetrics 


_. Product B, such as a video 


will fuel this trend, as progressive scholars 
begin to tell — and to be rewarded for — 
compelling, data-driven stories about the 
impact of their diverse products. 

Tenure and hiring committees will adapt, 
too, with growing urgency. Ultimately, sci- 
ence evaluation will become something that 
is done scientifically, exchanging arbitrary, 
biased, personal opinions for meaningful dis- 
tillations of entire communities’ assessments. 
We can start to imagine the academic depart- 
ment asa sports team, full of complementary 
positions (theorists, methodologists, educa- 
tors, public communicators, grant writers 
and so on). Coming years will see evaluators 
playing an academic version of Moneyball 
(the statistical approach to US baseball): 
instead of trying to field teams of identical 
superstars, we will leverage nuanced impact 
data to build teams of specialists who add up 
to more than the sum of their parts. We will 
wonder at how meaningful evaluation could 
have been accomplished in the slow, coarse, 
paper-native world of the journal. 

The peer-reviewed journal is the product 
of some 350 years of selective breeding, a 
thoroughbred exquisitely tuned to wring the 
best possible performance from the paper- 
based world of its creation. But the Web 
has irrevocably changed our information 
environment — it is no longer the habitat 
the journal evolved in. 

The next decade will see a dramatic dieback 
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in journals and a Cambrian explosion of com- 
munication species evolved for the Web- and 
network-centric world. Most of these will be 
failures. We see this already in the dozens of 
‘online journal clubs’ and ‘Facebooks for sci- 
ence; many are virtual ghost towns. But critics 
of these early failed systems miss the point. In 
the early twentieth century, there were more 
than 100 car makers; by mid-century, there 
were barely a dozen. But they were the dozen 
who had got it right. Similarly, we can expect 
the intense selection pressure generated by 
scholars’ hunger for relevant information to 
drive rapid evolution in systems for certifica- 
tion, recommendation and assessment. 


CONCERNS 

The new era of scholarly communication will 
pose challenges. One significant concern is the 
ease of ‘gaming’ the algorithms we will rely on 
for filtering. What is to keep me from recruit- 
ing a few friends or students to tweet, read 
and save my new article, for example? Several 
companies have already emerged to sell social 
media follows and likes. Will I need to spend 
grant money on this sort of payola to compete? 

Of course, any important metric will be 
gamed, just as citations are’. But altmetrics 
are not the easy target they may seem, thanks 
to their diversity and volume. Gaming a 
single metric may be easy; tampering with 
dozens is not. Products with high numbers 
of tweets but no downloads, for instance, 
will stand out. And not just to humans: given 
enough data, machines turn out to be excel- 
lent at finding discrepancies in standard 
usage patterns. Credit-card companies, for 
instance, use ‘algorithmic forensics’ to nab 
identity thieves. The powerful pattern detec- 
tion of constantly evolving algorithms also 
maintains the robustness of Google search 
results and online advertising click counts 
— both of which are heavily gamed by highly 
motivated hackers*. These defences work in 
scholarly contexts, too: the Social Sciences 
Research Network and the Public Library of 
Science have already effectively used algorith- 
mic shields around download counts? (see 
go.nature.com/binjfk). 

A second criticism is that the very idea of 
quantifying scientific impact is misguided. 
This really will not do. We scientists rou- 
tinely search out numerical data to explain 
everything from subatomic physics to the 
appreciation of Mozart; we cannot then 
insist that our cogitations are uniquely 
exempt. The ultimate judge of scientific 
quality is the scientific community; its judge- 
ments are expressed in actions and these 
actions may be measured. The only thing to 
do is to find good measures to replace the 
slow, clumsy and misleading ones we rely on 
today. The great migration of scholarship to 
the Web promises to help us to do this. 

A final concern is that in seeking the wis- 
dom of crowds, we will end up governed 
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by the madness of mobs. Will a crowd- 
sourced scholarship be dominated by pro- 
vocative pap that fills without nourishing? 

Here we must recall first that schol- 
arship has always been a community 
enterprise, driven by building consen- 
sus among experts'”. So the question is 
not ‘Should we crowdsource?’ but ‘How 
should we crowdsource?. Second, we 
must dispose of the straw-man argument 
that hundreds of uninformed readers’ 
opinions will count for more than one 
Fields medallist’s recommendation. 
Authority and expertise are central in the 
Web era as they were in the journal era. 
The difference is that whereas the paper- 
based system used subjective criteria to 
identify authoritative voices, the Web- 
based one assesses authority recursively 
from the entire community. 

We now have a unique opportunity 
as scholars to guide the evolution of our 
tools in directions that honour our val- 
ues and benefit our communities. Here’s 
what to do. First, try new things: publish 
new kinds of products, share them in 
new places and brag about them using 
new metrics. Intellectual playfulness is a 
core scholarly virtue. Second, take risks 
(another scholarly virtue): publishing 
more papers may be safe, but scholars 
who establish early leadership in Web- 
native production will be ahead of the 
curve as these genres become dominant. 
Finally, resist the urge to cling to the trap- 
pings of scientific excellence rather than 
excellence itself. ‘Publication is just one 
mode of making public and one way of 
validating scholarly excellence. It is time 
to embrace the Webs power to dissemi- 
nate and filter scholarship more broadly 
and meaningfully. Welcome to the next 
era of scholarly communication. m 


Jason Priem is an information scientist 
at the University of North Carolina at 
Chapel Hill, North Carolina, USA, and 
co-founder of ImpactStory. 

e-mail: jason@impactstory.org 
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A fool’s errand 


Objections to the Creative Commons attribution 
licence are straw men raised by parties who want open 
access to be as closed as possible, warns John Wilbanks. 


opyright licensing is a topic usually left 

to law review articles, or obscure terms 

of service on websites, or agreements 
between publishers and libraries. But it is an 
essential element of the move towards open 
access — the free, immediate online availabil- 
ity of scholarly articles coupled with the right 
to use them fully in the digital environment. 

An article that is free to read is not neces- 
sarily open for all uses — often, it cannot be 
reused for text mining or in derivative works, 
for example. The permitted uses depend on 
the copyright licence used by the author. 

In my view, for an article to be considered 
truely open access, it has to meet the widely 
accepted definition in the Budapest Open 
Access Initiative — a set of recommenda- 
tions laid out by leaders of the open-access 
movement in 2001. That is, users must be 
able to “read, download, copy, distribute, 
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print, search, or link to the full texts of these 
articles, crawl them for indexing, pass them 
as data to software, or use them for any other 
lawful purpose without financial, legal, or 
technical barriers other than those insepara- 
ble from gaining access to the internet itself. 
The only constraint on reproduction and dis- 
tribution, and the only role for copyright in 
this domain, should be to give authors control 
over the integrity of their work and the right 
to be properly acknowledged and cited? 

Traditional publishing licences tend to 
place restrictions on at least one of these 
uses, and it isi easy for a reader to figure out 
what those are. If the reader is a computer, as 
is more and more prevalent, the restrictions 
are a spanner in the works. 

The use of the Creative Commons 
attribution licence (CC-BY) fulfils the com- 
munity definition of open access and avoids 
a future morass of articles with murky legal 
provenance and concomitant unclear reuse 
possibilities. CC-BY was launched in 2002, 
2 years before I started a 7-year stint as head 
of science initiatives at Creative Commons in 


BRENDAN MONROE 


SOURCE: OPEN ACCESS SCHOLARLY PUBLISHERS ASSOCIATION 


Mountain View, California. 

CC-BY has now come under attack from 
the International Association of Scientific, 
Technical & Medical Publishers, which is 
discussing the introduction of a licence that 
would implement some — but not all — of the 
commonly accepted tenets of open access. At 
a conference run by the association in Janu- 
ary, this was referred to variously as a “new” 
licence and even as “CC Plus”. 

It is a bad idea. Here's why. 


TRIED AND TRUE 

CC-BY isa liberal licence that allows any kind 
of use under copyright as long as the author 
is credited in the manner in which he or she 
specifies. It is more than a decade old, clear, 
well-tested and deeply established as an effec- 
tive open-access licence for both for-profit 
and non-profit publishers (see ‘Licence to 
share). It has been translated into more than 
50 languages and is legally enforceable around 
the world. No other open-access licence can 
claim its power, standing and adoption. 

Critics have lately dubbed CC-BY ‘viral’, 
and bridled against the idea of research coun- 
cils mandating its use as a way to implement 
open access for the scholarly literature. “Viral’ 
can be read either in the cultural sense — an 
article becoming wildly popular — or in the 
legal sense. The former is desirable. The lat- 
ter is false: CC-BY does not force derivative 
works to be relicensed under the same terms. 

Nor is itan unprecedented act for a funder 
to maximize its return on investment by spec- 
ifying that publications arising from its funds 
be published under a liberal copyright licence. 
Taking money for research comes with condi- 
tions: grants from the US National Institutes 
of Health, for example, come with more than 
70 such requirements, including data-shar- 
ing plans, annual reports, ethics training and 
gender indicators. That one of those condi- 
tions be intended to optimize a research 
article’s impact by enabling its reuse by other 
researchers, their robots or by entrepreneurs, 
sits well within the funding tradition. 

Any licence that is less open than CC-BY 
reserves the rights of the copyright holder 
to control certain reuses, and that requires a 
legitimate justification. Ifan article’s publica- 
tion costs have been covered by an article- 
processing fee, then reserving rights is just a 
means of double dipping. 

Furthermore, funders who want the maxi- 
mum impact are going to choose an existing 
standard legal tool that is interoperable with 
the vast majority of free culture and free soft- 
ware licences, not bet on a licence that might 
not be. Licences that distinguish between 
kinds of reuse, or discriminate against entre- 
preneurs, fail every definition of open access, 
open knowledge and open source. Tiny 
details of drafting, intentional or not, often 
render content under one licence legally 
incompatible with content under a different 


one. An essential function of the limited set of 
Creative Commons licences is to forestall the 
hobbling impact that licence proliferation has 
on the network effect of open culture. 

On my cynical days, I fear that this kind of 
hobbling is at the heart of a strategy to create 
‘open-access licences just for scholarly pub- 
lishers. These licences would reserve the most 
creative reuses for those who simply serve as 
the midwives for content, not for those who 
might go on to create works that can surprise, 
inform and delight. If we allow only a tiny set 
of predicted reuses, those are, by definition, 
the only reuses we will get — and they will 
benefit only the existing power players in 
scholarly publishing. 


SPECIOUS CONCERNS 

It is hard to precis all the specious concerns 
about why CC-BY will not work for schol- 
arly publishing. In brief, those opposing 
the licence often claim that it: would allow 
others such as drug companies to sell works 
downstream; implies author endorsement 
of shady overlay journals; would require 
all the elements of an article to be freely 
licensed (including photographs, music, 
modern art and, presumably, Hollywood 
films); and would make attribution on text 
mining unwieldy. 

CC-BY does indeed allow resale — of 
something that is already on the Internet for 
free. Anyone who pays for an object under 
CC-BY is either making a donation, or is pay- 
ing a tax for being inept at searching the Inter- 
net. And a few key elements of CC-BY make 
it possible to prevent dastardly uses. 

First, because attribution is up to the author 
and the journal, it is easy to make sure that any 
copy — if someone is trying to make a shady 
business of reprints, for instance — links back 
to the free version of the article. 

Second, CC-BY does not grant publicity 
rights. That means that attribution can be 
used to clearly disclaim endorsement without 


LICENCE TO SHARE 


Leading open-access scholarly publishers 
have increasingly adopted CC-BY since its 
launch in 2002. 
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the tangles caused by commercial restrictions. 
The attribution can carry a requested form 
of citation, including the URL to the origi- 
nal, peer-reviewed, free and branded copy of 
the article. Then, anyone wishing to reprint 
must also reproduce that citation or be in 
violation of the copyright. It makes it hard to 
imply endorsement or sell something when 
the object itself carries the provenance, and 

links, to the free version in a trusted journal. 
Similarly, publishers can communicate 
their desired attribution in a text-mining 
context. Indeed, because a text miner is only 
extracting ‘facts’ from the text, those facts are 
by law not covered by 


“When in copyright — and thus not 
doubt, use subject to the attribution 
running requirement. Of course, 
code that from a technical and 
someone else scientific perspective, 
has already readers will always want 
written.” to know the provenance 


of a fact, and it is good 
practice to link back to the source. Indeed, the 
idea of link-based provenance is built into the 
design of the Semantic Web. It is a technical 
problem, not a legal one. 

It is also easy to include in an article under a 
CC-BY licence items not subject to the licence 
— images, musical notation and more. That 
simply requires the rider “this article, unless 
otherwise noted, is available under CC-BY”, 
and a note placed by the elements that are not 
available. Thus, a photograph under a Crea- 
tive Commons licence can be used as one of 
Time magazine's Photos of the Year, for exam- 
ple, without rendering the rest of the publica- 
tion subject to the terms of the licence. 

In sum, the debate over CC-BY is actually 
an attempt to reinterpret what the ‘open’ in 
‘open access’ means. There are parties who 
want open access to be as closed as possible 
to protect their business models from change. 

A licence that is designed just for publish- 
ers might feel safer, but it is a fool’s errand. 
CC-BY exists. It is used widely and is driving 
tens of millions of dollars in annual revenue 
for scholarly publishers such as BioMed Cen- 
tral, Hindawi and the Public Library of Sci- 
ence. It fulfils the community definitions of 
openness. And it works with the vast web of 
existing free content. For any new licence to 
achieve all that, it would need to be CC-BY. 

It is encouraging that the open-access 
debate has moved from ‘should we?’ to ‘how 
do we?’ and that we're talking about the issue 
even in this august publication. It would be 
deeply sad if we were to fail now to draw on 
one of the basic lessons of the open move- 
ments that have come before: when in doubt, 
use running code that someone else has 
already written. m 


John Wilbanks is the chief commons officer 
at Sage Bionetworks in Seattle, Washington. 
e-mail: john.wilbanks@sagebase.org 
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How to hasten 


Opel) dCCESS 


Three advocates for a universally free scholarly 
literature give their prescriptions for the movement’s 
next push, from findability to translations. 


ALMASWAN 
Time to align 
policy 


Director of European advocacy, 
Scholarly Publishing and Academic 
Resources Coalition (SPARC) Europe 


Policies for open-access publishing have 
made rapid progress, raising a pressing 
issue — if the research literature is to become 
completely open within the next 5 years, dif- 
ferent policies must be aligned. For research- 
ers who are funded by more than one source, 
this harmonization is crucial in creating a 
single, simple path to making their work 
open access, affordably and sustainably. 

Funders, universities, nations and states 
need to work together now to secure good 
open-access policy. What does that look 
like? It is clear, accurate and focused. It sup- 
ports new models for research communi- 
cation. It does not let copyright become a 
hurdle and does not interfere with authors’ 
choice of journals. It mandates immediate 
access. It does not perpetuate a broken mar- 
ket, but finds ways to curb costs. 

Good policy doesn’t refer to ‘gold’ open 
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access as ‘author pays’ or ‘paid for; because 
66% of open-access journals do not charge. 
It uses the proper definition: journals that 
make their content immediately and freely 
available on the Internet. It doesn't call 
‘green’ open access ‘embargoed access’, 
because 60% of the time it is not. It defines it 
as literature that is made open access directly 
by the author, usually through a repository. It 
doesn’t assume that green open access harms 
publishers, because evidence shows that it 
does not. 

Good policy exploits the fact that gold and 
green routes can both provide articles with 
licences that maximize re-use (see page 440). 
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It introduces measures of compliance — 
and sanctions for non-compliance 
— because these shape behaviour. It 
embraces the fact that open access 
will cause disruptive change. And 
it puts the interests of research, 
and of the public that pays for 
it, before all others. 
Specifically, funders and 
universities must work 
together to align policy with 
respect to permissible embar- 
goes, timing of repository 
deposits for green open access 
and whether research grants can 
be used to buy gold open access. 
They must consider whether payments 
can be made to ‘hybrid journals (subscrip- 
tion journals that make articles open access 
fora fee) and the position that funders take 
on rights, licensing requirements and which 
repositories can be used. 

The European Commission (EC) has been 
explicit about its intentions on these details. 
From January 2014, it will require articles to 
be placed in institutional repositories imme- 
diately upon publication or acceptance. It 
permits only a six-month embargo in disci- 
plines within science, technology, engineer- 
ing and medicine, and a 12-month embargo 
in the humanities and social sciences. The 
policy encourages authors to retain copyright 
and to give publishers licences to publish their 
work, and it allows grant funds to be used for 
publishing charges. The EC has stated its 
hope that European countries will develop 
open-access policies in line with its own. 

State-level policy proposals in the United 
States align with proposed federal policy 
in terms of permitted embargo times (the 
maximum is 12 months). In Belgium and 
Ireland, funder and university policies are 
aligning their focus on deposits in reposito- 
ries, requiring this for research assessment 
and echoing the EC’s maximum permit- 
ted embargo times. In Spain, Norway and 
Denmark, policies that align with that of the 
EC are already in place. The two Austral- 
ian funders’ policies are similar, calling for 
deposit in repositories and differing only 
slightly on permitted embargoes. 

So, harmonization is happening. But the 
development of seamless policies among 
funders needs to be the focus of advocacy. It 
will mean less author confusion and greater 
compliance. Authors will begin to under- 
stand the potential of truly open research 
and be inspired to devise further innovative 
practices. Then we can expect true disrup- 
tion: a very different kind of scholarly com- 
munication, catalysed by good policy. 
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A Nature special issue. 
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MATTHEW COCKERILL 
Make indexing 
fast and fair 


Managing director, BioMed Central 


Imagine a world in which Google demands 
that websites prove their worth for several 
years before their pages become searchable. 
In essence, that is the model that is currently 
in place for scientific publishing. For a jour- 
nal to be indexed within PubMed Central, 
Web of Science and Scopus — in other 
words, for its publications to be easily visible 
to scientists — it must provide a track record 
of high-quality publications. The catch-22 is 
that while proving itself, that journal cannot 
offer authors one of their most important 
requirements: discoverability. 

For open access to take flight, the index- 
ing and visibility of new journals needs sort- 
ing out. PubMed Central's original policy of 
rapid inclusion of open-access journals was 
crucial in helping the model to prove itself. 
The retreat in 2009 from this inclusiveness 
has threatened to stymie evolution in science 
publishing, helping incumbents to defend 
their position and disadvantaging some new 
entrants. 

The controversy generated when the jour- 
nal eLife jumped the queue by appearing in 
PubMed Central before having published a 
single paper, let alone the current usual requi- 
site of at least 15, highlights how baffling entry 
criteria have become. It would be a disservice 
to the community for a high-quality journal 
such as eLife not to be promptly tracked. But 
the jump-start gave the unhealthy impression 
that, to paraphrase George Orwell, ‘all jour- 
nals are treated equally, but some are treated 
more equally than others. 


Conversely, one of the leading journals 
we publish at BioMed Central, Genome 
Medicine, has only just been added to Web 
of Science by Thomson Reuters 4 years after 
its launch. The delay occurred despite a 
strong editorial board, a steady publication 
rate of around 90 articles a year and a pre- 
dicted impact factor of around 6 — greater 
than those for 95% of the journals included 
in Web of Science. 

‘Predatory publishers’ — those offering 
journals of low quality — are undoubtedly a 
concern (see page 433). But endlessly raising 
the walls of the citadel, or making those walls 
scalable only by journals with friends in high 
places, is not the answer: it slows scientific 
progress. 

Instead, indexing services should err on 
the side of including content from any new 
journal that meets basic standards, per- 
haps flagging it with a ‘provisional tag, but 
ensuring that it is discoverable and citable. 
If journals are subsequently found to have 
poor editorial standards or excessively low 
citation rates, they can easily be dropped 
from the index, and prior content removed 
or flagged with a warning. 


DOUGLAS SIPP 
Translate local 
journals 


RIKEN Center for Developmental 
Biology, Kobe, Japan 


Once open-access publishing becomes com- 
mon practice, citizens of English-speaking 
countries will enjoy the unearned advantage 
of being able to read and use research funded 
by other nations. But those nations’ popula- 
tions may be unable to read articles report- 
ing research funded by their own taxes. 
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Governments and public-funding agencies 
in such nations will soon find themselves 
pressed to resolve this inequity. 

In the major east Asian scientific powers of 
China, Japan and South Korea — the world’s 
2nd, 4th and 12th most-productive nations 
in terms of numbers of research publications 
— research and funding institutions have 
established open repositories for published 
research. The National Science Library of 
the Chinese Academy of Sciences, for exam- 
ple, mandated in 2009 that its members file 
articles in its open 


repository withinone “The solution 
month of publication. will be costly 
Atpresent, thisseems and time- 


to be used mainly for 
articles in Chinese- 
language journals. Japan’s largest funder of 
basic scientific research, the Ministry of Edu- 
cation, Culture, Sports, Science and Technol- 
ogy (MEXT), proposed only last year that 
researchers receiving MEXT grants should 
report the access status of their publications. 
It has not specified any access requirements. 

Japan and South Korea have both invested 
in science communication and public-under- 
standing efforts to improve their citizens’ 
access to scientific information, through 
better funded science museums, media and 
online programmes and training of com- 
munication and outreach specialists. For 
example, in 2001 Japan opened a large sci- 
ence museum, the Miraikan, and there are 
now programmes in science communication 
and journalism at Hokkaido and Waseda 
universities. South Korea has increased fund- 
ing for its national science museum, and the 
Korean Academy for Science and Technology 
in Seongnam now offers media training. 

But these activities fall short of unmedi- 
ated access to full sets of data, methods and 
analyses. Local-language journals, another 
partial solution, remain uneven in quality 
and low in impact. English is still undisput- 
edly the common language of the sciences. 

The clearest way to achieve true open 
access in non-English speaking nations 
might be to borrow models from the open- 
access movement itself. A green open-access 
approach to lowering the language barrier 
could see funders call for scientists or insti- 
tutions to produce local language translations 
of any published work conducted using pub- 
lic funds and host them in open repositories. 
A gold open-access approach, by contrast, 
might subsidize translations by journal pub- 
lishers or other third parties. In either case, 
the solution will be costly and time-consum- 
ing, and require countries where English is 
not the native language to work harder than 
their anglophone counterparts to ensure 
public access to one of humanity’s most pre- 
cious resources — its knowledge. m 
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MILITARY HISTORY 


Dinner at the 
Fission Chips 


Mark Peplow assesses a chronicle of the blighted US and 
Soviet communities that fuelled the nuclear arms race. 


CC e had only washcloths, buckets, 
and sometimes rubber gloves,” 
recalls Faina Kuznetsova, a lab 


technician who routinely dealt with major 
leaks at Russia’s first plutonium-manufac- 
turing plant. “We mopped up the spills and 
poured them into big glass bottles. It was 
a very expensive compound and we were 
expected to recover every drop.” 

A multitude of such harrowing accounts 
fills the pages of Plutopia, a ‘hidden history’ 
of two communities — one American, one 
Soviet — that fuelled the nuclear arms race. 
Unusually, historian Kate Brown inter- 
viewed dozens of frontline workers for her 
meticulously researched account of how 
these two remote towns became indelibly 
linked by plutonium, and by catastrophic 
radioactive contamination. 

Brown’s story begins in December 1942, 
when the Manhattan Project to build nuclear 
weapons was in full swing, and the crucial 
ingredients — enriched uranium and plu- 
tonium — were in short supply. The US 
Army selected Hanford in Washington 
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state as the site of the 
world’s first plutonium 
plant: it had plenti- 
ful water and hydro- 
electric power from 
the Columbia River, 
and the site covered 
160 square kilometres 
of bare drylands. 

The plant’s first 
reactor fired up in 
autumn 1944, blasting 
slugs of uranium-238 


Plutopia: Nuclear 
Families, Atomic 
Cities, and the 
Great Soviet 

and American 


with neutrons tocre- Pjytonium 

ate plutonium-239. Disasters 

Less than a year later, «ATE BROWN 
Oxford University 


on 9 August 1945, the 
Hanford plutonium 
found its target via 
‘Fat Man’, the atomic 
bomb that destroyed Nagasaki three days 
after the less-advanced uranium fission 
bomb ‘Little Boy’ detonated over Hiroshima. 

As weapons stockpiles grew, Hanford’s 
growing workforce was housed in Richland, 


Press: 2013. 416 pp. 
$27.95, £17.46 
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one of Brown’s ‘plutopias’: a stage-managed 
American idyll where everyone was white, 
middle class and loved the bomb. Yet Rich- 
land’s chintzy Americana — the Fission 
Chips restaurant or the high school with 
the mushroom-cloud mascot — concealed 
a calculated programme to keep residents 
ignorant of the risks. 

These were considerable. Hanford’s 
operators used the surrounding country- 
side as a vast dumping ground. They rea- 
soned that dissipating the radioisotopes 
in river water, or the atmosphere, was the 
best way to dilute any danger. Radioactive 
waste poured daily into the environment, 
while hundreds of millions of ‘hot particles’ 
of radioactive metal fell like deadly snow 
across Richland. Meanwhile, the town grew 
into a corporate dictatorship, where plant 
managers controlled citizens’ lives in a 
way that would have made a Soviet general 
proud. 

Time and again, official review com- 
mittees denied the evidence. In the early 
years, site medics tested workers after long 
weekends, when most of the radioisotope 
contamination had already been urinated 
from the body. In 1958, the Atomic Energy 
Commission reported to Congress that 
from 1944 to 1958 there had been only one 
radiation-related injury at Hanford, out 
of 18,000 workers. In fact, by 1959, infant 
deaths in Richland were four times the state 
average. 

A pivotal point was the physicist Ernest 
Sternglass’s 1963 paper in Science showing 
a link between maternal exposure to X-rays 
and child mortality from cancer, with 
implications of similar effects for radioac- 
tive fallout. Independent studies followed, 
and by 1966 the radioactivity in Richland- 
ers’ bones had been found to be 50% higher 
than normal. 

Meanwhile, the Soviet leaders had taken 
note, and in 1945 began building their own 
plutopia in the remote Urals. In what Brown 
describes as the “Bronze Age beginnings 
of the Soviet atom’, conscripted labour- 
ers endured terrible conditions to build 
the Mayak plutonium plant and the closed 
city of Ozersk. The first reactor produced 
enough plutonium for a bomb by 1949, 
but many of the fuel slugs, loaded with the 
Soviet Union's entire stock of uranium, had 
cracked. So Mayak’s bosses ordered workers 
to unload and sort 39,000 irradiated slugs 
by hand, protected only by a fortifying glass 
of vodka. 

Thousands of exposed workers routinely 
suffered nausea, nosebleeds and intense 
pain, followed by crumbling bones, can- 
cers and death, Brown reveals. To quell the 
growing unrest, Ozersk was showered with 
cinemas and apartments filled with modern 
conveniences, and then touted as proof that 
Communism delivered prosperity — just as 
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federal dollars were shaping Richland into 
the very model of the American dream. 

Soviet plant managers also copied, and 
surpassed, the Americans’ approach to waste 
disposal. By 1951, 20% of the nearby Techa 
river was radioactive effluent from the plant, 
flowing through dozens of towns in which 
more than 124,000 people lived. Then, in 
1957, an underground waste-storage tank 
exploded, belching forth a mushroom cloud 
that irradiated hundreds of thousands of 
people. 

Ozersk’s residents cleaned up the plant 
with wire brushes and hoses — almost 
certainly receiving severe radiation doses. 
There is no agreement on how many died as 
a result of their exposure. Today, the region 
is a vast radioactive swamp — yet Mayak 
continues to process radioisotopes, and 
managers plan to keep dumping radioac- 
tive waste into open reservoirs until at least 
2018, Brown says. 

Back in Richland, whistle-blowers and 
investigative journalists picking away at the 
town’s anodyne facade forced the US gov- 
ernment to reveal the true scale of Hanford’s 
contamination in 1986. Just weeks later, the 
catastrophic explosion at the Chernobyl 
nuclear power plant in Ukraine occurred. 
In the face of this double whammy, Han- 
ford was slated to close. Together with the 
Mayak plant, it had released much more 
radioactivity into the environment than 
Chernobyl and the site will take half a cen- 
tury to clean up. Yet most Richland resi- 
dents fought hard against Hanford’s critics. 
They loved their town’s social homogeneity, 
orderly management and relative prosper- 
ity, and feared that anti-nuke whiners would 
cost them their apple-pie lifestyle. 

The only underwritten character in Pluto- 
piais plutonium itself. Brown never explains 
why bomb-makers preferred plutonium over 
uranium (it is easier to purify, and requires a 
smaller critical mass), how it was processed 
or how fission works. And although radia- 
tion physics is swamped with confusing and 
archaic units, Brown fails to guide the reader 
through rems, rads, roentgens and curies, 
when a little context could help to make 
sense of the numbers. 

Nevertheless, Plutopia has important 
messages for those managing today’s 
nuclear facilities, arguing for caution and 
transparency. Highly subsidized commu- 
nities are still a feature of civilian nuclear 
programmes in many countries, including 
Japan. It is no coincidence, Brown suggests, 
that the Fukushima nuclear accident in 2011 
was characterized by poor safety protocols, 
official denials and a heavy use of underpaid 
workers to clean up the mess. = 


Mark Peplow is a science journalist based 
in Cambridge, UK. 
e-mail: peplowscience@gmail.com 


Books in brief 


Darwin Deleted: Imagining a World Without Darwin 
Peter J. Bowler UNIV. CHICAGO PRESS 336 pp. $30 (2013) 
Is there anything to add on Darwin and his work? In his 
‘counterfactual’ history, Peter Bowler manages it neatly, 
wondering what might have happened had Darwin not published 
DARWIN DELETED his theory of evolution by natural selection. Bowler argues that at 
x the time, only Darwin had the originality and largeness of vision 
to craft his big idea, but its very boldness polarized thinking. The 
Mime theory, Bowler surmises, would have emerged sans Darwin, but 
y later — which, ironically, might have eased the broad acceptance 
of evolution. 


Butterfly People: An American Encounter With the Beauty of the 
World 

William R. Leach PANTHEON 416 pp. $32.50 (2013) 

Butterflies in their thousands blanketed summer fields in 
nineteenth-century America. These ‘flying jewels’ drew a generation 
of amateur natural historians from the cultural chrysalis. Here, 
historian William Leach celebrates several — including Herman 
Strecker and Samuel Scudder — who created a home-grown 
field. Fed by Linnaeus, Darwin and crowd-sourced photographs 
and specimens, this scientific search for beauty collided early on, 
however, with the country’s commercial drive. 


Picking Up: On the Streets and Behind the Trucks With the 
Sanitation Workers of New York City 

Robin Nagle FARRAR, STRAUS AND GIROUX 304 pp. $28 (2013) 

Daily in New York city, around 9,000 people clear away 11,000 
tonnes of household waste. In her 10-year, sometime-firsthand 
study of ‘san man’ crews, cultural anthropologist Robin Nagle shines 
a light on their invisible lives. She reveals them as agents of urban 
reform and public health; traces the history of sanitation in the city, 
starting with eighteenth-century reformer Cadwallader Colden’s 
yellow-fever control; and evokes the physical and psychological toll 
of this dangerous, filthy, necessary work. 


Robot Futures 

Illah Reza Nourbakhsh THE MIT PRESS 160 pp. $24.95 (2013) 

This glimpse into the future of robotics hums with enthusiasm. In his 
work, roboticist Illah Reza Nourbakhsh has created a raft of objects 
and capabilities, from robot three-dimensional visioning systems to 
a highly propulsive pogo stick. Here, prefacing each chapter with an 
imagined scenario, he forecasts how bots will invade commerce, the 
home and the human body. The possibilities — such as therapeutic, 
injectable robot colonies — are often provocative, but tempered 

by astute insights into the ethical and social implications of a 
roboticized world. 


Narwhals: Arctic Whales in a Melting World 

Todd McLeish UNIV. WASHINGTON PRESS 216 pp. $26.95 (2013) 

The tusked, deep-diving, upside-down-swimming narwhal is a 
cetaceous enigma. Questions hang over its feeding habits, strange 
dental arrangements, population and migration. In this portrait of 
the species, Todd McLeish mixes research, observations from High 
Arctic trips and engaging detours into iceberg ecology and more. 
While the debates rage on — over the tusk as a Sensory organ, for 
instance — many agree that the animal’s adaptive capacity may not 
keep pace with the shrinkage of sea ice. 


28 MARCH 2013 | VOL 495 | NATURE | 445 


© 2013 Macmillan Publishers Limited. All rights reserved 


| COMMENT | BOOKS & ARTS 


GASTROENTEROLOGY 


Down the hatch 


David Katz savours a guided tour of the ins and outs of the gastrointestinal tract. 


singular recollection came to meas I 
A= the adventure of Gulp. I used 

to teach a course called ‘Clinical 
Concepts in Public Health at the Yale School 
of Public Health in New Haven, Connecti- 
cut. This was my opportunity to convey to 
bright young graduate students the marvels 
of the human body in both function and 
dysfunction. It was an epiphany. Compress- 
ing the greatest hits of medical education 
into a single semester helped me to fully 
appreciate how truly astounding it all was. 

My impression is that Mary Roach expe- 
rienced her guided tour of the gastrointes- 
tinal tract in much the same way. There is 
a real sense that her mind boggled at its 
wonders. In her blend of excellent investi- 
gative journalism, history, popular science 
and stand-up comedy, you are in for divert- 
ing surprises: don't expect this book to bea 
straight shot from the northern to the south- 
ern orifice. Each port of call along the way is 
more an excuse for fascinating detours than 
a destination in its own right. 

We learn, for instance, about the work of 
some of the world’s more specialized and 
surprising scientists, past and present. One 
has devoted a career to studying what causes 
variance in the odours of flatulence; another 
researches the secrets of saliva, such as what 
makes it flow. We learn about a scientist who 
pushed a live eel through a fistula — a sur- 
gically created hole — ina dog’s stomach, 
so that only the eel’s head protruded, to 
determine the extent to which anima- 
tion upsets the actions of digestive 
juices. The answer proved to be: not 
at all. The eel was digested; the scien- 
tist, divorced. 

We learn how snakes digest prey 
they have swallowed whole (through 
extremely powerful enzymes) and the 
likely cause of Elvis Presley’s death 
(Hirschsprung’s disease, or toxic 
megacolon). Roach reveals why the 
various worm species that are swal- 
lowed alive by reptiles and amphib- 
ians cannot eat their way back out. 
We are told the possible origins of 
the myth of fire-breathing dragons 
(hint: it has to do with decompos- 
ing mammals in the guts of large 
snakes, in proximity to Stone Age 
campfires). 

In Gulp, Roach imparts her 
myriad insights eloquently, and 
with acerbic wit. A dog’s nose 


“takes in the sights by smell’, and “it takes 
approximately as long to chew narwhalas it 
does to hunt them” She regales us with prime 
examples of nominative determinism: a sci- 
entist named Spitz studies saliva; one called 
Grime researches laundry detergents. 
Adroitly avoiding squeamishness, Roach 
reveals the coda to this story — the anus — 
as a remarkable orifice. She shows how sen- 
sitive it has to be to differentiate solid, liquid 
and gas, and to perform its yeoman’s work 
accordingly. As Robert Rosenbluth, a medic 
she quotes, puts it: “No engineer could 
design something as multifunctional and 
fine-tuned as an anus. To call someone an 
asshole is really bragging him up.” (Person- 
ally, I find that statement smacks of a slight 


excess of enthusiasm, 
and take umbrage on 
behalf of my kidneys 
and liver, to say noth- 
ing of my brain.) 

| The last chapter 
| addresses the gut’s role 
} in autoimmunity, and 
discusses the potential 
therapeutic value of 
‘fecal transplants’ in 


| Mary Roach 


Gulp: Adventures 
on the Alimentary 


Canal treating colitis caused 
MARY ROACH 7 

by the bacterium 
W. W. Norton/ oe aes 
Oneworld: 2013. Clostridium difficile. 
352 pp. $26.95, A promising approach 
£11.99 toa difficult problem, 


Roach tells us, assum- 
ing we can get past the ‘ick factor. 

If there is anything at all unpalatable here, 
it may be the relative paucity of practical 
information. If something ails your innards, 
Gulp is unlikely to help. There is nothing 
remedial, therapeutic or prescriptive on 
the menu here — but Roach has made no 
promises to offer them. Her food for thought 
is proffered purely in the service of enter- 
taining enlightenment. Roach lays out the 
acts of ingestion, digestion and egestion, all 
incisively chewed over — with tongue ever 
in cheek. How she does so without once 
biting that tongue is a testament to her 
authorial gifts. 

Another memory surfaced as I com- 
pleted the journey along this extraordi- 
nary canal. Some 20 years ago, during 
my medical residency, I wrote a skit 
for an annual roast of our attending 
physicians. There were several well- 
loved gastroenterologists among the 
teaching faculty, and I lampooned 
their avid enthusiasm for all things 
gastrointestinal by asserting that 
they were, on the side, producing the 
world’s first science-fiction movie 
filmed entirely through a colono- 
scope: There's Life on Uranus! It’s a 
film Roach might have directed. m 


David Katz is the founding 
director of Yale University 
Prevention Research Center in 
Derby, Connecticut, USA, editor- 
in-chief of the journal Childhood 
Obesity and president-elect of 
the American College of Lifestyle 
Medicine. 
e-mail: david.katz@yale.edu 
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Q&A Robert Darnton 
Knowledge liberator 


Robert Darnton heads the world’ largest collection of academic publications, the Harvard 
University Library system. He is also a driver behind the new Digital Public Library of America. 
Ahead of its launch in April, he talks about Google, science journals and the open-access debate. 


How did you come to run the Harvard 
libraries? 

As a graduate student at Oxford in 1963, I 
began writing about books in revolutionary 
France, helping to found the discipline of 
book history. I was in my academic corner 
writing about Enlightenment ideals when 
the Internet exploded the world of academic 
communication in the 1990s. As president 
of the American Historical Association I 
started a programme to make dissertations 
into e-books in 1999. Before I knew it I was 
involved in other electronic projects. Har- 
vard invited me to become director of the 
libraries in 2007. From that position I have 
been trying to help grow the digital land- 
scape to serve the public good. 


Did you get involved in Google’s effort to 
scan books? 

In 2002 Google began an ambitious project 
to digitize every book in the world. It was 
intended asa search project: type in a query, 
and Google would show you snippets. They 
asked university libraries for books, which 
they would scan for free. At Harvard we didn't 
permit them to take works under copyright, 
but other libraries gave them everything. 


What happened then? 

In 2005 Google was faced with a lawsuit from 
authors and publishers claiming copyright 
infringement. After three years of negotia- 
tions, they came back with a settlement for a 


commercial library. 
Google would split 
the profits with 
copyright owners, 
leaving libraries to 
buy back digital subscriptions to their own 
books. In 2011, a federal court declared the 
settlement unacceptable. Now Google has a 
great digital database of books, which it can 
use in cooperation with publishers to sell 
books, but in my view it’s not creating the 
kind of library that was promised. 


Digital Public Library 
of America launch 
Boston Public Library, 
Boston, Massachusetts. 
18-19 April 2013. 


So you’re launching a public alternative? 

Along with research libraries, archives and 
museums throughout the country, Harvard 
is supporting the non-profit Digital Public 
Library of America (DPLA). At its launch in 
April, the DPLA will make freely available 
online large runs of documents from digi- 
tized special collections. Harvard will con- 
tribute, among many previously inaccessible 
documents, 243 rare medieval manuscripts 
and 3,628 daguerreotypes, including the first 
photographs of the Moon. This initial core 
of offerings will probably grow to include 
about 5 million books in the public domain 
and perhaps 2 million ‘orphar’ books whose 
owners cannot be found. We have also made 
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our data compatible with Europeana, a digi- 
tal library that gathers collections from 27 
countries. 


Will the DPLA contain science? 

We do not currently have plans to digitize 
textbooks or journals. But sometimes the raw 
data are more important: scientists no longer 
leave a paper trail. Digital data are more frag- 
ile than printed material. Eventually we want 
our online library to preserve raw experimen- 
tal data, not just published results. 


Why not science journals? 

In recent years their prices have risen four 
times faster than inflation. An average 
chemistry journal subscription is more 
than US$4,000 annually. An institutional 
subscription to the Journal of Comparative 
Neurology costs more than $29,000 per year. 
Libraries used to spend half their budget on 
journals; now many spend 90% and can't 
afford monographs. 


How did this situation arise? 

Academic publishing grew up in the nine- 
teenth century, when professional associa- 
tions needed to disseminate their members’ 
works. Recently, publishers have taken over, 
creating thousands of journals so specialized 
that there is little competition. Most research 
is funded by taxes, but the taxpayers do not 
have free access to the results of the work that 
they financed. Academics do the research, 
write the papers and referee them — all for 
free — then buy back their work at exorbi- 
tant prices. Publishers can rake in the money, 
because libraries can't unsubscribe without 
alienating scientists. It wasn’t a conspiracy 
or the result of wicked aims. But the whole 
system is fundamentally flawed and we must 
do something about it. 


What is Harvard doing to fix it? 

The Harvard faculty has decided that the 
price increases are unsustainable. We have 
73 libraries and each must limit the haemor- 
rhage of funds. In April 2012, a faculty coun- 
cil asked faculty and students to consider not 
contributing to journals that charge outra- 
geous prices. But voluntary effort alone wont 
solve the problem. We've created a repository 
where all Harvard professors by default must 
deposit their articles, although they can opt 
out, and the traffic from around the world 
is astonishing. The university will pay most 
of the processing fees for articles submitted 
to open-access journals such as those of the 
Public Library of Science. By working with 
other universities, we hope to make most 
journals open-access and eventually to 
change the economics of journal publish- 
ing. Open-access journals get more hits than 
closed ones. This is where the future lies. m 


INTERVIEW BY JASCHA HOFFMAN 


28 MARCH 2013 | VOL 495 | NATURE | 447 


Correspondence 


Online integrity 
training falls short 


Education in the responsible 
conduct of research (RCR) will 
receive a long-overdue critique 
at a conference discussing 

the work of the US Office of 
Research Integrity on 3-5 April 
in Baltimore, Maryland. 

The US National Institutes of 
Health has required recipients of 
training grants to receive RCR 
education since 1990, and it has 
been a prerequisite of the US 
National Science Foundation 
(NSF) since 2010 for all students 
and postdocs funded by its 
research grants. 

In 2012, under contract 
from the National Center for 
Professional and Research Ethics 
at the University of Illinois, 

I reviewed the NSF policies of 27 
major universities. I found that 26 
depend solely (12) or largely (14) 
on online RCR training, with all 
but two using the Collaborative 
Institutional Training Initiative 
(CITI; www.citiprogram.org). 

Outsourcing ethics education 
in this way suggests that 
RCR education is developed 
and executed with an eye 
to expedience rather than 
excellence. It risks sending a 
message to young researchers 
that the university and its 
scientists do not place much 
emphasis on responsible conduct. 

European universities, which 
do not yet have RCR mandates 
in place (see N. Axelsen and 
X. Bosch Nature 489, 208; 

2012), should learn from the 
US experience and develop 
meaningful RCR programmes. 
These need to be taught by the 
people the students want to 
emulate — scientists. 
Kenneth D. Pimple Indiana 
University, Bloomington, USA. 
pimple@indiana.edu 


Too much reliance on 
anonymous tip-offs 
Several scientific journals and 


ethics committees are deferring 
to anonymous judgment when 


it comes to charges of plagiarism 
and falsification of results in 
published research papers. As 

a bioethicist, I believe that this 
practice is risky, even when the 
tipster’s views are valid: it could 
itself damage the integrity of 
scientific research. 

The authors of the blog 
Retraction Watch (www. 
retractionwatch.com) hold the 
view that anomalies detected by 
someone using the pseudonym 
‘Clare Francis are useful to 
scientific journals, irrespective 
of his or her anonymity (see A. 
Marcus and I. Oransky Lab Times 
7, 39; 2011). But this unorthodox 
‘review process pollutes the ethics 
that underpin scientific progress. 

The practice could stimulate 
witch-hunting and pillorying. 
There is a danger that research- 
integrity committees could 
succumb to moralistic drift and 
confuse errors with misdeeds, 
underestimating context anda 
scientist’s professional record. 
Stigmatized researchers might be 
tempted to exact revenge on their 
colleagues. 

In my opinion, this is 
not the way to improve the 
moral standards of science’s 
contribution to society or to build 
public engagement in science. 
Gilberto Corbellini Sapienza 
University of Rome, Italy. 
gilberto.corbellini@uniroma1.it 


Standardize records 
of place of death 


We suggest that a record of 

the place of death should 

be incorporated into death- 
registration data as a useful 
additional health metric (Nature 
494, 281; 2013). 

End-of-life care is a major 
public-health issue, given the 
rising number of deaths from 
chronic illnesses that have 
multiple and complex symptoms. 
Knowing where people die 
can be an indicator of where 
they were cared for, which is 
important for allocating health- 
care resources and for assessing 
related public-health policies. 


Despite surveys that show a 
prevailing preference for home 
death among patients, care- 
givers and the public, most 
deaths in Europe still occur in 
hospital (B. Gomes et al. BMC 
Palliat. Care 12, 7;2013). We 
are supplying Portugal with 
such survey information to help 
improve the recording of place 
of death in its electronic death- 
registration system. 

The place of death is registered 
in a few other countries (the 
United States and Canada, for 
example), but its categorization 
is inconsistent — sometimes 
even within a country. 

Location categories need to be 
internationally standardized 

(J. Cohen et al. BMC Public 
Health 7, 283; 2007) and this 
potentially valuable health-care 
resource put under the political 
spotlight. 

Barbara Gomes* King’ College 
London, UK. 
barbara.gomes@kcl.ac.uk 

*On behalf of 4 co-signatories (see 
go.nature.com/jm4tki for full list). 


Nuclear-waste site 
geology is paramount 


As a former geological adviser to 
the UK government on nuclear- 
waste repositories, I would like 
to clarify some points in your 
discussion of the quest for a 
British nuclear-waste disposal 
site (Nature 494, 5-6; 2013). 
Nirex was a UK government 
agency (not an “independent 
group’ ) that was set up in 1982 
to find a geologically suitable 
site. In 1991, it chose Sellafield 
in Cumbria — one of two 
nuclear industry sites — from a 
list of 537 potentially available 
locations. Neither of these two 
sites was among the geologically 
most suitable, according to 
Nirex’s seven-stage selection 
process. Its 1997 planning 
application for an underground 
laboratory at Longlands Farm, 
near Sellafield, failed because 
the inquiry inspector concluded 
that Nirex did not understand 
the site’s complex geology (see 
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go.nature.com/5p7yae). 

The government’s 2008 
White Paper, Managing 
Radioactive Waste Safely 
(MRWS), put the fact that 
Cumbria volunteered to consider 
housing the waste ahead of 
scientific considerations. This 
contravenes international 
guidelines and practice in 
which national geological 
searches are conducted before 
seeking permission from local 
communities. 

To some, this seemed like a 
back-door attempt to return to 
the Sellafield district, ignoring 
both the inspector’s original 
report and the geological 
problems of the area (see 
go.nature.com/wob9rf). 

You blame a “lack of political 
will” for the failure of the 
Nuclear Decommissioning 
Agency to “sell the facility 
to local residents” On the 
contrary, the now-defunct West 
Cumbria MRWS process spent 
£3.5 million (US$5.3 million) on 
publicity over the past two years. 

So Cumbria County Council 
has demonstrated strong 
political will by listening 
to both the geological and 
the democratic arguments 
against proceeding with a deep 
repository for nuclear waste in 
the region. 

David Smythe University of 
Glasgow, UK. 
dksle@udcf.gla.ac.uk 


CORRECTIONS 

The Outlook article ‘A many 
layered thing’ (Nature 492, 
$52-S54; 2012) contained an 
error in the graphic ‘Caught in 
a loop’. The labels for the Th1 
and Th17 cells were shown 
switched over. 

And in the Outlook article 
‘Mine, all mine!’ (Nature 

495, S2-S3; 2013), the map 
‘Where does gold come from?’ 
originally presented world gold 
production figures in kilograms 
but with the label of tonnes. 
The values have been corrected 
online to show tonnes. 
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Robert Richardson 


(1937-2013) 


Discoverer of superfluidity in helium-3. 


obert Richardson, along with physicist 
Row Lee and myself, discovered that 

helium-3, a rare but stable isotope of 
helium, becomes a superfluid when cooled to 
a minuscule fraction of a degree above abso- 
lute zero. Until that discovery, in the early 
1970s, the superfluidity effect — in which 
aliquid flows without friction — had been 
seen only in helium-4, whose atoms are 
bosons. No one had been able to produce 
the phenomenon with fermions, whose 
nuclei have spin properties different from 
those seen in bosons. 

The discovery astounded the physics 
community, which had all but given up 
trying to produce the phenomenon in 
helium-3. In the decades since, the under- 
standing of superfluidity in this isotope 
has influenced the development of all 
sorts of superconducting materials, with 
applications ranging from power grids 
to nuclear magnetic resonance (NMR) 
instruments. It has also helped astrophys- 
icists to better understand neutron stars, 
which have superfluid cores. 

Richardson, who died on 19 February, 
was born in Washington DC in 1937. He 
grew up in Arlington, Virginia, where 
his father worked for a local telephone 
company. As a boy, Richardson was a 
keen hiker, camper and birdwatcher. 
He was a member of the Boy Scouts of 
America and became an Eagle Scout, the 
highest rank attainable in the programme, 
in the shortest time allowed. 

Richardson received his undergraduate 
and master’s degrees in physics at Virginia 
Polytechnic Institute in Blacksburg, before 
spending six months as an officer in the 
US Army. Before his military service, he 
had intended to earn a master’s in business 
administration, but he was unimpressed 
by the management courses he took in 
the Army and decided to pursue a PhD in 
physics instead. 

In 1960, Richardson began his PhD 
work in Horst Meyer's laboratory at Duke 
University in Durham, North Carolina. 
It was at Duke that Richardson met Betty 
McCarthy, who was also a graduate student 
in physics. The two married in 1962; their 
daughters, Jennifer and Pamela, were born 
in 1965 and 1966. In Meyer’s lab, Richardson 
learned how to make the equipment needed 
to investigate how collections of atoms 
behave at very low temperatures. 

In 1966, soon after Richardson received 
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his PhD, he and his young family moved 
to Ithaca, New York, where he became a 
postdoctoral researcher for David Lee in 
the low-temperature group at Cornell Univer- 
sity. Richardson joined the Cornell faculty in 


Ty: r 


1968 as an assistant professor in the physics 
department, and ultimately became a full 
professor there. I was one of Lee’s graduate 
students when Richardson became a pro- 
fessor. Often working late into the night, 
Richardson was always around and willing 
to give advice, irrespective of whose students 
we were. 

By the early 1970s, chemists and materials 
scientists had for decades been probing the 
behaviour of materials using NMR — in 
which nuclei in a magnetic field absorb 
and re-emit electromagnetic radiation at 
a specific frequency, depending on the 
properties of the atoms and the strength of 
the field. But Richardson played a key part 
in using the technique for extremely low- 
temperature studies. The question he, Lee 
and I set out to answer was whether the direc- 
tion of the nuclear spins in solid helium-3 
could spontaneously order at sufficiently low 
temperatures. 

To produce very low temperatures, 
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we compressed liquid helium-3 until it 
began to solidify, causing the mixture 
of liquid and solid to cool. We initially 
(mistakenly) believed that we had pro- 
duced magnetic order in the solid helium-3, 
at about 0.002 Kelvins. In fact, 
we had observed a new physical state: 
superfluidity in liquid helium-3. The 
three of us shared the 1996 Nobel Prize 
in Physics for this discovery. 

Richardson won or shared many 
awards and honours apart from the Nobel 
prize, including the Sir Francis Simon 
Memorial Prize in 1976 and the Oliver E. 
Buckley Condensed Matter Prize in 1981. 
That year, he was also made a fellow of the 
American Association for the Advance- 
ment of Science. He became a member 
of the US National Academy of Sciences 
in 1986. 

Richardson was an exceptional citizen, 
at Cornell University and in the broader 
physics community. From 1990 to 1996, 
he served as director of the laboratory 
of atomic-state physics at Cornell. From 
1998 to 2007 he was the university’s first 
vice-provost for research. Richardson 
was also a member of the US National 
Science Board, which oversees policy 
issues related to research and education 
in science and engineering. 

Richardson, or Bob to those who knew 
him, was competitive, passionate and 
creative, but also humble and consid- 
erate. The great sadness of his life was the 
tragic death, in 1994, of his younger daugh- 
ter, Pamela, from heart failure. As a way 
of dealing with their grief, Bob and Betty, 
together with Alan Giambattista, a physicist 
at Cornell, threw themselves into writing a 
physics textbook entitled College Physics. 

Bob went out of his way to help and guide 
young scientists. He was an avid participant 
in ‘penny pitching’ — a game, played ritually 
in the low-temperature lab for many years, 
that involves tossing pennies at a wall to 
see whose coin lands closest to it — and he 
often hosted croquet matches at his home. 
He once challenged a graduate student who 
had been training for a marathon to a race up 
several flights of stairs; the student declined, 
wanting to keep Bobs health intact. = 


Douglas D. Osheroff is professor emeritus 
at Stanford University, Stanford, California 
94305, USA. 

e-mail: osheroff@stanford.edu 
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Persuasive evidence on HIV policy 


Antiretroviral therapy has revolutionized the fight against the HIV/AIDS pandemic. Surveillance analyses of a large 
population in rural South Africa make a compelling case that sustained support for this therapy is essential. 


GRACE JOHN-STEWART 


epidemic, one glimmering light is anti- 

retroviral therapy. This treatment involves 
a combination of at least three drugs that sup- 
press HIV, and it prevents progression of the 
infection to AIDS and death. As emergency 
investment to fight HIV/AIDS evolves into 
a long-term response, it remains crucial to 
measure the direct and indirect benefits of HIV 
treatment to sustain investment. Two studies 
published in Science (Tanser et al.' and Bor 
et al.) showa large increase in the overall pop- 
ulation life expectancy and a decrease in new 
cases of HIV infection after expanded use of 
antiretroviral therapy in South Africa (Fig. 1). 

That antiretroviral therapy greatly increases 
the survival of individuals with HIV has been 
known since the 1990s’. Global scale-up of this 
therapy, however, started in 2004, following 
large investments by the US President's Emer- 
gency Plan for AIDS Relief (PEPFAR) and the 
Global AIDS Program (GAP)*. Since then, 
antiretrovirals have been given to more than 
8 million people worldwide (Fig. 2), leading 
to millions of life-years saved — an unprec- 
edented public-health success story’. 

Studies of HIV-discordant couples (in which 
one partner has HIV and the other doesn’t) 
have shown®” that if the HIV-infected partner 
receives antiretroviral therapy, transmission 
rarely occurs. The most compelling confirma- 
tion of this observation was a multi-country 
phase III clinical trial’ (HTV Prevention Tri- 
als Network 052), in which antiretrovirals 
decreased the risk of sexual transmission of 
HIV between couples by 96%. These studies, 
however, involved motivated patients with 
high levels of adherence to treatment. 

The Science papers add a unique perspec- 
tive on the benefits of antiretroviral therapy 
on life expectancy and transmission at a gen- 
eral population level. The studies investigate 
a meticulously characterized population of 
more than 101,000 individuals from house- 
holds in a 434-square-kilometre surveillance 
area in rural KwaZulu-Natal, South Africa. 
Between 2000 and 2011, the households were 
surveyed twice a year for birth and death data 
by the Africa Centre for Health and Popula- 
tion Studies in KwaZulu-Natal. A subset of 


lE the tragic story of the global HIV/AIDS 


Figure 1 | A woman with HIV infection in KwaZulu-Natal, South Africa. 


these individuals was serially tested for HIV 
to estimate the rates of new HIV infections. 
Starting in 2004, Tanser and colleagues’ map- 
ping of individual household location, local 
HIV prevalence and the proportion of HIV- 
infected individuals in specified geographi- 
cal areas who were receiving antiretrovirals 
(antiretroviral-therapy coverage) allowed 
them to discern whether living in an area with 
better coverage was associated with lower risk 
of acquiring HIV infection. 

Both studies demonstrate the profound 
impact of antiretroviral-therapy coverage at a 
population level. Bor and colleagues show that 
expansion of therapy programmes increased 
the overall adult life expectancy by 11.3 years 
in KwaZulu-Natal. Tanser et al. estimate that 
in communities in which 30-40% of those 
infected with HIV were taking antiretrovirals, 
uninfected individuals were 38% less likely to 
acquire HIV than those in communities in 
which the therapy was less widespread. 

Population studies such as these, which 
involve individuals who may have much 
lower drug adherence than those participating 
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in carefully controlled clinical trials, are a 
powerful complement to clinical trials. They 
give us a sense of impact in the ‘real world. 
The reality is that the new studies focus on a 
fairly specific real world — one in which HIV 
prevalence was staggeringly high, allowing 
detection of the impact of therapy on overall 
adult life expectancy. Nonetheless, the stud- 
ies are persuasive in the larger landscape of 
global HIV. Several other countries in south- 
ern and eastern Africa that also have a high 
prevalence of HIV could expect to see similar 
results. In 2012, 25 countries saw a decline 
of more than 50% in new HIV infections’. 
This decrease was due to multiple factors, but 
undoubtedly antiretroviral therapy played a 
pivotal part. 

These data, therefore, should persuade 
policy-makers to sustain or increase invest- 
ment in this form of therapy. Global aid funds 
are constricting or staying stable, whereas 
health needs are persisting or expanding. It 
is clear from the two studies that for regions 
of high HIV prevalence, including South 
Africa, greater investment in antiretroviral 
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Figure 2 | Use of antiretroviral therapy. The 
number of people receiving antiretroviral therapy 
by the end of each year between 2002 and 2011 

in low- and middle-income countries by region. 
(Adapted from ref. 5.) 


therapy is essential. To attain one of the goals 
of the Joint United Nations Program on HIV/ 
AIDS (UNAIDS) — 15 million individuals 
on antiretroviral therapy by 2015 — contin- 
ued support from GAP and PEPFAR will be 
necessary’. 

Scale-up of antiretroviral therapy has led to 
gains in programme efficiencies. Not only have 
drug costs decreased, but also systems to pro- 
vide antiretroviral therapy have diversified and 
become less expensive through recruitment 
of less-highly trained health-care providers 
and introduction of less-intensive laboratory 
monitoring’. Moreover, funding sources for 
provision of antiretroviral therapy are branch- 
ing out, with increased domestic investment by 
the affected countries to complement external 
support’. For instance, South Africa has sub- 
stantially increased its domestic investment in 
antiretroviral therapy, and the present studies 
provide clear justification for sustaining and 
expanding this investment. 

At a national or global policy level, anti- 
retroviral therapy has been the subject of 
scrutiny in many ways: for its dual treatment 
and prevention benefits; for its impact on 
other health needs such as childhood illnesses, 
reproductive health or other infectious 
diseases, including tuberculosis; for the 
potential strain the therapy places on health 
systems, for example whether there is an 
uneven focus on HIV/AIDS at the expense 
of other illnesses in settings with shortages of 
health workers and resources; and for its con- 
tribution to well-being or economic productiv- 
ity. Antiretroviral-therapy programmes have 
led to broad improvements in health systems, 
innovations in clinical education and over- 
hauls of laboratory and clinical infrastructure 
— much more than the global community 
initially envisioned. Financial investments 


were linked to measurable and diverse 
beneficial outcomes. 

For the future, it remains important to 
wrestle with what outcomes are most relevant 
to health-resource investment decisions. 
Although increased overall life expectancy and 
a reduction in new HIV infections at a popu- 
lation level are compelling for policy-makers 
in regions with high HIV prevalence, other 
metrics, including comparisons with other 
health investments and impact on morbidity, 
quality of life or productivity, may be more 
useful in lower-prevalence settings. = 


Grace John-Stewart is in the Department 
of Global Health, University of Washington, 
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Nanoparticle structures 
served up on a tray 


A neat approach that involves laying an array of nanoparticles on a graphene 
sheet supported on an iridium substrate has allowed accurate measurement of 


the nanoparticles’ atomic structure. 


SIMON BILLINGE 


hether we are aware of the fact or 

not, modern life would be pro- 

foundly different without precious- 
metal nanoparticle catalysts’. Every drop of 
petrol used has passed over a platinum cata- 
lyst during production, and the exhaust gases 
of a car’s engine similarly pass over these 
catalysts in the catalytic converter on the way 
out of the car. The catalysts are also essential 
to future sustainable energy technologies, 
such as cars and buses powered by hydrogen 
fuel cells. These processes rely on chemical 
reactions that take place on the catalyst 
surface. But despite our dependence on such 
reactions, we know little about them because 
we lack the tools that can ‘see’ the processes 
involved and measure the interactions between 
reacting molecules and the catalyst. Writing in 
Physical Review Letters, Franz et al.” describe 
an ingenious exploitation of a gift from 
nature that will allow us to make a huge dent 
in this problem. 

A catalyst plays host to a chemical reaction, 
allowing it to proceed without itself being con- 
sumed in the process. Much like a skilled host 
at a party, it brings together like-minded guests 
who then depart, leaving the host to find and 
facilitate other favourable matchings. Under- 
standing the secrets of the successful match- 
maker will help us to find better, cheaper and 
more naturally abundant materials that do the 
same job. But how do we unlock those secrets? 


Catalysis by precious metals takes place on 
the surface of minute nanoparticles, which 
maximize the surface area available for the 
chemical reactions and, equally importantly, 
minimize the amount of expensive precious 
metal needed. 

To understand these catalytic chemical 
processes, we must follow the interactions 
between the reagents and the metal atoms on 
the nanoparticle surface. The surface atoms 
become rearranged to minimize the surface 
energy, a feature called surface reconstruction, 
and the reactivity and interactions between the 
atoms and the reagents vary widely depend- 
ing on this rearrangement. But not only do 
we not know what the surface structure looks 
like, we do not even have robust methods for 
working out the ‘bulk nanoparticle structure, 
a dilemma sometimes referred to as the nano- 
structure problem’. 

When atoms become arranged into a peri- 
odic structure, as if they were on a chessboard, 
they form crystals and we have powerful tools 
for locating them. More recently, with the 
advent of intense synchrotron X-ray sources 
based on particle accelerators, these methods 
have been extended to the study of surface 
atoms. But, crucially, this can be done only 
when the arrangement of the surface struc- 
ture is also periodic. When the arrangement 
exists only on the nanoscale it is invisible, as if 
we are blindfolded. There are two approaches 
to addressing this problem: either develop 
tools for solving structure at the nanoscale’, 
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or arrange the nanoparticles into ordered 
periodic arrays and use the existing tools of 
crystallography”. 

Franz and colleagues’ ingenious approach is 
to serve up the nanoparticles on a tray made of 
graphene (a carbon sheet just one atom thick) 
supported on an atomically flat iridium sur- 
face. The gift from nature is that the delicate 
mismatch between the iridium substrate and 
the graphene tray produces an orderly array of 
ripples in the graphene called a moiré pattern. 
This array is the perfect size in which to nestle 
identical 82-atom nanoparticles of iridium® 
arranged periodically and with the same ori- 
entation (Fig. 1) — exactly the requirements 
for solving a crystallographic structure. For 
the first time, our blindfold is removed and 
accurate quantitative details about the nano- 
particles are revealed, including the nature of 
their surfaces and the modification to their 
structure that results from their interactions 
with the substrate. 

These results would have far-reaching con- 
sequences for the understanding of catalysis if 
the system could be studied when it is under- 
going catalysis and if the changes in structure 
and surface chemistry were followed under dif- 
ferent conditions, such as varying temperature. 
Powerful theoretical approaches for following 
these reactions are also available, but these 
have never been validated against experiment 
because of the lack of results from atomically 
resolved experiments such as this. That is the 
good news. The bad news is that nature gifted 
us with the structure of iridium nanoparticles 
and not with that of the industrially used plati- 
num, which lies tantalizingly one place above 
iridium in the periodic table. 

The authors hope that the magic can be 
made to work with platinum as well, but this 
remains to be seen. Delicate tuning of the 
metal atoms and substrate is required for them 
to form periodically arranged nanoparticles: 
too good a match, and a completely uniform 
monolayer will form; too poor a fit, and they 
will not remain in registry with the moiré 
pattern and will not be periodic and orien- 
tationally aligned. There is still a lot that can 
be learned from this model system of iridium 
particles. However, we can expect a much 
greater impact if the trick can be repeated, not 
just with platinum nanoparticles, but also with 
a range of materials, so that catalytic perfor- 
mance can be correlated with changes in struc- 
tural and electronic properties. 

Even so, commercial catalysts are not made 
in this way, and a final understanding of what 
is happening in a commercial catalytic stack 
will require study of the nanoscale structure 
in situ in a temporally and spatially resolved 
way in a complex, heterogeneous system. 
This is the frontier of complexity that must be 
pushed back if the high-performance materi- 
als needed in many next-generation technolo- 
gies for energy, the environment and health 
are to be developed. It will take a combination 


Figure 1 | Nestling nanoparticles. Franz et al.” have measured the atomic structure of 82-atom 
nanoparticles of iridium (yellow) by arranging them periodically in a graphene-iridium surface (black) 


and applying surface X-ray crystallography. 


of the most powerful emerging experimen- 
tal tools, validated computational methods, 
ingenuity and further gifts from nature. But 
we are close to being able to do this, and steps 
such as those reported by Franz et al. take us 
even closer. = 
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How the ocean exhales 


A record of biogenic opal production in the subtropical Atlantic Ocean fuels the 
theory that ocean circulation, rather than winds, drove the release of carbon 
dioxide from deep marine waters at the end of the last ice age. SEE LETTER P.495 


ELISABETH SIKES 


he concentration of atmospheric carbon 

dioxide was lower during the last ice 

age’ than in the pre-industrial modern 
age, and it is widely accepted that changes to 
deep-ocean circulation during this glacial 
period increased the amount of CO, seques- 
tered in the deep ocean. When deep-ocean 
circulation shifted to modern patterns, the 
sequestered CO, was vented back to the atmos- 
phere, most probably through the Southern 
Ocean. Two plausible controlling mechanisms 
for this degassing have been proposed, one in 
which latitudinal shifts in winds trigger CO, 
release and synchronize post-glacial warming 
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in the Northern and Southern hemispheres’, 
and another in which ocean circulation 
triggers venting and so provides the inter- 
hemispheric link’. On page 495 of this issue, 
Meckler et al.* report evidence of enhanced 
growth of silica-producing (siliceous) 
plankton known as diatoms (Fig. 1) at low 
latitudes at the time when CO, was rapidly 
venting. This suggests that the degassing was 
triggered by transient changes in deep-water 
density driven by fluctuations in northern- 
water masses, rather than bya shift in Southern 
Ocean winds. 

Although the ocean surface is largely in 
equilibrium with the atmosphere, the deep 
ocean holds 50 times more CO, than the 


BEN FRANDSEN 


JOHN CLEGG/SPL 


atmosphere. This is because ofa process called 
the biological pump: first, phytoplankton fix 
CO, in the ocean surface by photosynthesis 
to form organic carbon; then, when they die, 
the phytoplankton sink into the deep ocean, 
where the organic carbon is consumed by 
other organisms and respired back to CO,, 
which forms dissolved carbonate. 

The biological pump also depletes the 
surface waters of nutrients such as nitrogen 
(needed by all phytoplankton) and silica 
(needed by diatoms), transporting them to the 
deep ocean along with the carbon. These nutri- 
ents and dissolved carbonate become trapped 
in the deep ocean by the shallow surface layer, 
which acts somewhat like the lid on a bottle of 
a fizzy drink. Ocean circulation returns water 
containing high levels of carbonate and silica 
to the surface in the Southern Ocean and at 
upwelling regions. When this happens, bio- 
logical productivity at the surface is enhanced 
and — if degassing outpaces the biological 
pump — CO, is released to the atmosphere. A 
record of the elevated productivity is preserved 
when the siliceous remains of the organisms 
settle to the sea floor and accumulate as 
opal sediments. 

During glaciations, factors such as reduced 
formation of deep water in the North Atlantic 
and increased ice cover capping the Southern 
Ocean effectively trapped more CO, in the 
ocean. Early in the most recent deglaciation 
period, iceberg discharge into the North Atlan- 
tic shut down the formation of deep waters in 
this region’. This fundamental change in the 
organization of deep-water masses brought 
warmth to the Southern Ocean and could 
have enhanced upwelling, CO, release and 
southern deep-water formation because of 


density changes alone®. The warming has 
also been suggested to have triggered a shift 
of the westerlies towards Antarctica, form- 
ing part of a feedback mechanism in which 
the shifted winds caused upwelling and more 
CO, to be vented up to the atmosphere, lead- 
ing to yet more warming, wind changes and 
CO, release”. Conversely, evidence from opal 
deposition in the Southern Ocean” suggests 
that the core of the southern westerly winds 
shifted to the south early in the deglaciation 
period and that this may have been the main 
cause of invigorated CO, release. 

Meckler et al. follow the trail of opal depo- 
sition to the subtropical North Atlantic, and 
build on the concept* that increased opal sedi- 
mentation at low latitudes reflects stimulated 
diatom productivity caused by an increase in 
northward transport of dissolved silicate from 
the Southern Ocean. They observed evidence 
of simultaneous rapid opal sedimentation in 
both the subtropical North Atlantic and the 
Southern Ocean during the deglacial shut- 
down of northern deep-water formation, and 
depart from the prevailing interpretation by 
inferring that this corresponded to a slow- 
down in transport of subsurface water from the 
south. The authors correctly point out that the 
combination of stronger winds and enhanced 
diatom productivity around Antarctica would 
have increased the formation of silicate-poor 
water in the south; because this water feeds 
shallow subsurface waters (the thermocline) 
at low latitudes, it would have depressed 
productivity in the tropical North Atlantic. 

The authors instead suggest that reduced 
formation of waters at intermediate depths in 
both hemispheres caused a shallowing of the 
boundary of the silicate-rich southern deep 


Figure 1 | A source of silica. Diatoms are a major group of phytoplankton that encase themselves in a silica 
shell. When these marine organisms die, their shells sink to the ocean floor, forming opal sediments. Meckler 
et al.’ have used opal sediments as a geochemical record of ocean circulation at the end of the last ice age. 


NEWS & VIEWS | RESEARCH | 


waters (see Fig. 4b of the paper’). This would 
have allowed the transfer of silicate and dis- 
solved carbon from the deep waters into the 
photic zone of the ocean — the upper ocean 
region that receives enough light for photo- 
synthesis to occur. Similar to the situation 
in the Southern Ocean today, a competition 
would have ensued in which enhanced pro- 
ductivity in the photic zone pulled CO, back 
down into deep waters, even as more dissolved 
carbon and nutrients emerged from the deep. 
Meckler and colleagues propose that the flow 
of CO, and nutrients outpaced the biological 
pump, and conclude that the observed opal 
sedimentation is diagnostic of deep-ocean 
reorganization and carbon out-gassing dur- 
ing deglaciation, rather than the signature of 
carbon draw-down. 

The prevailing consensus has shifted 
towards winds in the Southern Hemisphere 
as being the driver of CO, release during 
the last deglaciation®”. Meckler et al. put the 
ocean back in control and keep the trigger 
in the Northern Hemisphere. The authors’ 
case hinges, in part, on a re-evaluation of the 
implications for ocean circulation of a range 
of geochemical evidence dating from that cru- 
cial time period. This evidence was previously 
proposed to indicate an increased influx of 
silicate-rich shallow subsurface waters to the 
low-latitude Atlantic. Meckler and colleagues 
attribute the evidence to shallowing of a sili- 
cate-rich deep-water mass (as detailed in the 
Supplementary Information for their paper’). 
Further understanding will come with a refine- 
ment of the timing of key deglacial events””” 
and with a fuller characterization of the shal- 
low water masses that form at high latitudes 
and which feed the thermocline at low lati- 
tudes'’. The authors’ study provides a valuable 
piece of the puzzle for deciphering the ocean's 
role in driving CO, changes during glacial and 
deglacial periods. = 
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X-RAY CRYSTALLOGRAPHY 


One size fits most 


Crystalline ‘sponges’ offer a way to impose order on small molecules so that 
their structures can be solved by X-ray crystallography. This enables nanogram 
quantities of material to be analysed using the technique. SEE ARTICLE P.461 


PIERRE STALLFORTH & JON CLARDY 


uch of science involves describing 
Mite that we cannot see, and no 

method has revealed more about 
the hidden world of molecular structures than 
X-ray crystallography’. But the sample prepa- 
ration required for this technique can be dif- 
ficult, especially when only small quantities of 
a compoundare available. On page 461 of this 
issue, Inokuma et al.’ report an approach that 
can overcome this problem, greatly increasing 
the technique’s applicability. 

X-ray crystallography has been crucial for 
solving the structures of an array of small 
molecules, including antibiotics, industrial 
catalysts, artificial sweeteners and the neuro- 
transmitters that act as signals in the brain. The 
importance of the technique is reflected by the 
effect that a paper on the subject can have on 
journal impact factors. For example, when an 
article’ describing the most widely used set of 
programs for analysing small-molecule X-ray 
data was published in 2008, the impact factor 
of the publishing journal jumped from 2.05 in 
the year of publication, to 49.93 in 2009 and 
54.33 in 2010, sinking back to 2.08 in 2011. 

Given the prominence of the method, it is 
surprising to discover that X-ray crystallogra- 
phy has a major liability: it requires crystals. 


Figure 1 | Order imposed bya framework. These paintings by M. C. Escher 
exemplify features of different crystal structures. a, In most crystals, the 
molecules are tightly packed together, as are the fish in Symmetry Drawing. 

b, The components of crystalline sponges, however, adopt a regular porous 


The regular, repeating arrangement of mol- 
ecules in crystals scatters X-rays into a series 
of amplified and spatially constrained beams 
whose intensity can be easily measured. The 
intensities are computationally processed to 
provide an image of the molecule in the crys- 
tal. Single molecules, however, scatter X-rays 
to produce a weak, continuous beam, which is 
of little use for analytical purposes. 

Of course, other techniques such as mass 
spectrometry and nuclear magnetic resonance 
can be used to obtain molecular structures, but 
they do so through an inferential process of 
ruling out alternatives** that differs funda- 
mentally from the images generated by X-ray 
crystallographic analysis. Nevertheless, these 
alternative techniques can work with sample 
sizes much smaller than those required for 
X-ray crystallography, and, most importantly, 
they do not require crystals®. 

X-ray analysis has benefited from the highly 
intense X-ray beams produced by synchro- 
trons, from the development of efficient ways 
for measuring scattered beams and from power- 
ful algorithms used for the computational pro- 
cessing that provides molecular images. These 
advances directly or indirectly allow smaller 
crystals to be used, but none of them addresses 
the need for crystals. Growing crystals remains 
more of an artisanal skill than an intellectual 
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one, and is based on few general principles and 
lots of trial and error. These difficulties are often 
compounded by the limited quantities available 
for many samples — in general, it is difficult to 
crystallize samples of less than one milligram. 

Inokuma et al. report a way to circumvent 
crystallization of the sample. Their approach is 
breathtakingly simple: they grow host crystals 
that have large, well-defined, interconnected 
cavities, and allow these crystals to soak up 
small molecules from a solution of the sample. If 
the small molecules are the right size and shape 
to be trapped by the host's cavities, they become 
arranged in an orderly fashion that allows them 
to be studied using X-ray crystallography. 

By way of analogy, imagine a departure 
gate at a busy airport, with seats arranged in 
a two-dimensional grid. Passengers for the 
flight that departs from this gate enter the air- 
port through different entrances, mingle with 
passengers for many other flights and explore 
different paths and attractions on their way to 
the gate. But shortly before departure, only the 
passengers on the departing flight occupy the 
seats, or most of the seats, in the lounge. In this 
analogy, the empty, regularly arranged seats are 
the cavities in the host crystals, and the selec- 
tion principle — the appropriate boarding pass 
— is molecular shape. 

For host crystals, Inokuma and colleagues 
used a type of material that has many names: 
metal-organic frameworks, porous materials, 
or, perhaps most appropriately for this appli- 
cation, crystalline sponges’. The structures of 
these sponges are nothing like the illustrations 
of crystals found in introductory texts, which 
depict spheres packed tightly together — simi- 
lar to the elements of tessellation paintings by 
M.C. Escher (Fig. 1a). Instead, they resemble 
the steel scaffolding of skyscrapers under con- 
struction, with the large internal cavities of 


network, such as the one shown in Cubic Space Division. c, Inokuma et al? 
induced a lattice-like arrangement of molecules by trapping them in the pores of 
crystalline sponges, much like the arrangement of fish in Depth. This enabled the 
authors to solve X-ray crystal structures of nanogram quantities of compounds. 
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future rooms outlined by steel girders (Fig. 1b). 
In crystalline sponges, the girder intersections 
are metal atoms, and the girders are long, lin- 
ear, rigid organic molecules. The properties of 
the internal cavities depend on the properties, 
such as the length, of the connecting groups. 
It was known that crystalline sponges could 
be designed to absorb molecules selectively, 
but the authors report variants that have just 
the right degree of promiscuity (the opposite 
of selectivity). The materials are promiscu- 
ous enough to accommodate many different 
shapes of molecules, but selective enough to 
hold trapped molecules in only one orienta- 
tion. The sponges also confine the molecules 
loosely enough to allow reversible binding, so 
that the molecule can try out several differ- 
ent binding orientations until it finds the one 
with the lowest energy. The researchers found 
that their crystalline sponges pulled most of 
the guest molecules they tested out of solution 
and held them in an ordered array (Fig. 1c) so 
that clear structures of the molecules could be 
obtained using X-ray crystallography. 


CELL BIOLOGY 


Inokuma and colleagues used their method 
to solve the structures of an impressively 
diverse set of examples, including a naturally 
occurring molecule, miyakosyne A, that had 
defied full analysis by other methods. One 
of the researchers was also able to determine 
the structures of six molecules in a blind 
test. Impressively, the technique reduces the 
amount of sample needed for analysis by up 
to about 1,000-fold, compared with quantities 
that are typically required for conventional 
X-ray crystallography. 

It remains to be seen what percentage of 
molecules will be amenable to the authors’ 
technique. Moreover, molecules must fit into 
a single cavity in the sponge: miyakosyne A 
seems to be near the upper size limit for which 
the reported sponges will be useful. And for 
the method to be widely adopted, the crys- 
talline sponges developed by Inokuma and 
co-workers must become easily available, for 
example from a commercial supplier. But if 
these initial results are generally applicable, one 
can expect to see an increase both in reported 


Receptor signals 
come in waves 


The activity of G-protein-coupled receptors is not limited to the cell surface. 
Evidence from microscopy points to three temporally, spatially and perhaps 
functionally distinct waves of signalling by these receptors. SEE LETTER P.534 


MARTIN J. LOHSE & DAVIDE CALEBIRO 


anobodies, as their name suggests, 
| \ | are the smallest fully functional anti- 
body fragments known’. They repre- 
sent the end domain of single-protein-chain 
antibodies that occur naturally in camels and 
a few other animals’. Because of their small 
size and their flexibility, which allows access 
to parts of proteins that normal antibodies can- 
not reach, they have become of great interest 
to research into biological receptors. In fact, 
they have been instrumental in stabilizing and 
crystallizing the active forms of G-protein- 
coupled receptors (GPCRs) and their associ- 
ated G proteins’, which are of relevance to the 
treatment of many disorders, including heart 
disease, asthma, allergies and pain. In this issue 
(page 534), Irannejad et al.* use such active- 
state-specific nanobodies to determine where 
and when GPCRs and their G proteins signal 
in cells*. 
The conventional view of GPCR signal- 
ling holds that the receptor couples with its 


*This article and the paper under discussion‘ were 
published online on 20 March 2013. 


G protein only at the cell membrane. Several 
events are then thought to terminate this 
signalling, including binding of the inhibi- 
tor proteins B-arrestins’ to the receptor and 
internalization of the receptor through the 
process of endocytosis in clathrin-coated 
pits, which pinch off the membrane to form 
intracellular vesicles called endosomes. It was 
subsequently discovered® that B-arrestin 
binding to a GPCR may itself trigger a dis- 
tinct wave of ‘non-conventional’ signals 
downstream, such as activation of ERK 
enzymes, which regulate cell growth and 
survival. 

Once inside endosomes, cell-surface recep- 
tors are either degraded or recycled back to the 
cell membrane; they were, however, thought 
to be functionally inactive. Nonetheless, it 
emerged that receptor signalling may also 
occur from endosomes’. Indeed, such inter- 
nal signalling to G proteins has been observed 
for the GPCRs of parathyroid hormone’ and 
thyroid-stimulating hormone (TSH)’, and 
of the signalling molecules sphingosine- 
1-phosphate”® and dopamine”’. This activ- 
ity of internalized receptors is thought to 
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molecular structures and in the preparation of 
alternative crystalline sponges. One can even 
imagine that, in the near future, researchers will 
not bother trying to crystallize new molecules. 
Instead, a small set of crystalline sponges will be 
used to absorb the molecules, and X-ray analy- 
ses performed to see which sponge allows the 
clearest image of the molecule. = 
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constitute a third wave of signalling, follow- 
ing the G-protein-mediated and f-arrestin- 
mediated waves. 

Irannejad et al. now provide photographic 
and video evidence for all three signalling 
waves. They investigated the well-studied 
GPCR known as the B,-adrenergic receptor 
(B,AR) and its associated G, protein. B, AR-G, 
activity has been shown to stimulate produc- 
tion of the intracellular second-messenger 
molecule cyclic AMP in a reaction catalysed 
by the enzyme adenylyl cyclase. This in turn 
induces physiological responses such as 
dilation of blood vessels and bronchi, and 
the tremor that comes with excitement or 
fear. Irannejad and co-workers used two 
nanobodies: Nb80, which binds to active 
6,AR, and Nb37, which recognizes guanine- 
nucleotide-free G, — a transition state in G, 
activation. 

The authors detected two temporally and 
spatially separated waves of B,AR signalling 
through G,. Nb80 was recruited to the active 
receptor not only at the cell membrane but 
also — albeit later on — on the membrane 
of intracellular vesicles. Similarly, active 
G, was detected with Nb37, both at the cell 
membrane and, later on, on intracellular 
vesicles. Both waves of signalling led to 
the accumulation of cAMP. 

Between these two waves, B,AR bound to 
B-arrestin and then clustered in clathrin- 
coated pits, where it is thought to trigger a 
wave of non-conventional signals, including 
ERK activation (Fig. 1). Intriguingly, nei- 
ther Nb80 nor Nb37 bound to the pits, sug- 
gesting that both Nb80 and G, dissociated 
from B,AR at this stage. It could be that the 
B-arrestins actively displace the nanobodies, 
or that only the receptors that are not labelled 
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Clathrin- 
coated pit 
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Figure 1 | One receptor, three signals. Irannejad et al.’ show three waves of signalling by the G-protein- 
coupled f,-adrenergic receptor (B, AR). a, The first wave occurs on the cell membrane and involves 
ligand-receptor interaction with the G, protein, resulting in increased production of cAMP molecules 
by the enzyme adenylyl cyclase (first cAMP wave; see graph). b, The second wave occurs from clathrin- 
coated pits (and possibly their downstream vesicles) and involves receptor interaction with B-arrestin 
proteins, activation of ERK enzymes and other non-conventional signals. c, Wave three occurs after 
receptor internalization to intracellular vesicles — most probably endosomes — and is the second wave 
to involve G,, resulting in further increases in intracellular cAMP (second cAMP wave). The authors 
detected active forms of B, AR and G, with the Nb80 and Nb37 nanobodies, respectively. 


by Nb80 reach the clathrin-coated pits. 

Cells abound with various types of intra- 
cellular vesicle, and it should be determined in 
which of these the second G-protein-mediated 
signalling wave occurs. Work on parathyroid- 
hormone receptors*”’, and now on B,AR, 
points to early endosomes, whereas studies 
of TSH receptors””* suggest that the vesicles 
in which they were signalling belonged to the 
Golgi complex or trans-Golgi network, which 
mediate intracellular trafficking. It could be 
that receptor-containing endosomes eventu- 
ally fuse with other vesicular structures where 
the receptors might meet ‘new G proteins. 

It would be interesting to know whether and 
how ligands that activate GPCRs are internal- 
ized with the receptors to produce the second 
wave of cAMP production. When such ligands 
bind to receptors with high affinity (as in the 
case of TSH and parathyroid hormone), they 
could simply remain bound to the receptors 
during internalization. For low-affinity, rap- 
idly dissociating ligand-receptor pairs, such as 
6, AR and its ligand, this is unlikely. Neverthe- 
less, a certain number of dissociated ligands 
might be internalized in the same vesicle as 
their receptor, and in the tight space within 
the vesicle the two molecules might reunite. 

Strong evidence indicates that GPCRs can 
exist in several active conformations’*"’. Are 
the three waves of 8,AR signalling associated 
with different conformations of this receptor? 
Notably, Nb80 seems to dissociate from B,AR 
before the receptor enters clathrin-coated 
pits, which could be because 8, AR adopts a 


different active conformation when it binds 
to B-arrestin. 

It will be particularly exciting to discover 
whether the two distinct waves of G-protein- 
mediated signalling have different physio- 
logical consequences, and how to specifically 
interfere with each for therapeutic purposes. 
It seems that cAMP generated at the cell 
membrane and at internal sites in the cell may 
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produce different responses. In the case of 
TSH-receptor signalling, for example, the stim- 
ulation of thyroid-hormone secretion has been 
hypothesized to involve cAMP generated inter- 
nally’; this second wave of cAMP signalling, 
and the secretion response, can be inhibited 
by blocking receptor internalization*’. 

Irannejad and colleagues’ results will not 
just entice researchers to dig deeper. They also 
show once more that nature has invented many 
tricks and dodges that allow a few chemical 
messages to transmit a wealth of biological 
information into a cell. = 
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Tubular worms from 
the Burgess Shale 


The discovery, in 500-million- year-old rocks, of fossil acorn worms that lived 
in tubes illuminates the debate about whether the ancestor of vertebrates was a 
mobile worm-like animal or a sessile colony dweller. SEE LETTER P.503 


HENRY GEE 


nderstanding the origin of verte- 

| brates is fascinating not just because 
it appeases our self-interest, but 
because it is inherently difficult. Vertebrates 
seem to have a qualitatively more complex 
construction than other animals. This means 
not only that the origin of distinctive features 
of vertebrate anatomy, such as the head or 
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the neural crest, are difficult to trace among 
invertebrates, but also that the evolution- 
ary roots of vertebrates as a whole are hard 
to fathom. On page 503 of this issue, Caron 
et al.' address a fundamental issue in this 
regard: whether the ancestor of the deuter- 
ostomes (the larger assemblage to which 
vertebrates belong) was a free-living, worm- 
like creature or a sessile, perhaps colonial, 
animal. Their answer, for now, is that it was 


solitary, worm-like and motile’. 

Caron and colleagues describe fossilized 
marine worms found in the Burgess Shale 
of western Canada, which dates to 505 mil- 
lion years ago, in the Middle Cambrian 
period. The fossils — which the authors name 
Spartobranchus tenuis — are enteropneusts, 
also known as acorn worms, and the 
authors show that these particular worms 
are unusual in that they lived at least part 
of their lives in tubes. This dwelling habit is 
otherwise unknown in enteropneusts, but is 
common in their minute, typically colonial 
cousins, the pterobranchs. At first glance, 
the discovery seems modest enough, but 
the finding helps to resolve the sometimes 
vexed question of the relationships between 
enteropneusts and pterobranchs and, 
from that, the probable habit of the earliest 
deuterostomes (Fig. 1). 

The anatomy of enteropneusts has long 
fascinated those questing to understand 
vertebrate origins. The early career of the 
biologist William Bateson was spent study- 
ing these creatures to that end. However, the 
lack of a fossil record, and any sound way to 
reconstruct phylogeny, thwarted his attempts 
— instead, he went off to become the founder 
of genetics’. 


*This article and the paper under discussion? were 
published online on 13 March 2013. 
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What makes enteropneusts especially inter- 
esting is that their midsection is perforated by 
a series of openings, the pharyngeal gill slits. 
These seem to be distinguishing features of 
deuterostomes as a whole: they are found in 
chordates (including some vertebrates) and 
arguably in some primitive echinoderms 
(starfish and their allies). Enteropneusts 
have hints of other features that suggest a 
close alliance with chordates, such as a dor- 
sal, hollow nerve cord and an axial stiffen- 
ing rod called a notochord. These features, 
if they occur at all, are now thought to have 
evolved in parallel in these clades. But the 
occurrence of pharyngeal slits seems to be a 
solid benchmark of common ancestry. 

Molecular work over the past 20 years or so 
has revolutionized our understanding of deu- 
terostome relationships. It is now clear that 
enteropneusts and pterobranchs (which are 
collectively referred to as the hemichordates) 
are more closely related to echinoderms than 
to other deuterostomes’. This discovery revi- 
talized and added weight to a long-held, but 
contentious, suggestion that serial structures 
in some primitive echinoderms might be phar- 
yngeal gill slits homologous with those found 
in chordates*”, 

Another focus of interest is the relationship 
between enteropneusts and pterobranchs. 
There is no doubt that they are closely related, 


Pterobranchs 


VAS 


))) |) Spartobranchus 
g//''''' tenius 


Tube-dwelling 
lost 


Harrimaniid 
enteropneusts 
Other 
enteropneusts 
Echinoderms 
Chordates 


Figure 1 | The habits of our ancestors. This simplified phylogeny shows the evolutionary branching of 
clades within the deuterostomes, and the stages at which pharyngeal gill slits and a tube-dwelling habit 
were gained and lost. Caron and colleagues’ describe enteropneust (acorn worm) fossils that are very 
similar to modern harrimaniid enteropneusts, but that are associated with tubes, an unusual habit for this 
otherwise free-living group. The authors name the fossils Spartobranchus tenuis. Their discovery suggests 
that the ancestor of pterobranchs and S. tenuis adopted a tube-dwelling habit before pterobranchs became 
the sessile, colony-dwelling animals that they are today, and that the ancestor of deuterostomes was 


probably a worm-like, motile animal. 
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but which came first, the generally sessile 
lifestyle of pterobranchs or the free-living 
mode seen in enteropneusts? Because hemi- 
chordates generally seem rather primitive, 
the probable tube- dwelling habit of the deu- 
terostome ancestor hangs on this question. 
Most early researchers backed one idea or the 
other, although zoologist Walter Garstang, 
then seen as a pre-eminent authority, backed 
both. His initial work suggested a motile 
ancestry’*, but later in his career he settled 
on sessility’. However, more recent work has 
suggested something rather startling: that the 
habit of the enteropneust is the more primi- 
tive of the two, and that pterobranchs are most 
closely related to a particular group of entero- 
pneusts — the harrimaniids*. This finding 
meant that pterobranchs could be considered 
as somewhat specialized, sessile versions of 
enteropneusts. 

However, the question remained: how did a 
large, free-living worm evolve into the minus- 
cule tube-dwelling pterobranch that filters 
detritus from sea water using an elaborate 
crown of tentacles. The fossil record preserves 
all kinds of possibly intermediate morpholo- 
gies between one creature and another. But the 
record is notoriously delphic, if not laconic, 
and on this subject it has even less to say than 
usual. Enteropneusts, being soft-bodied, do not 
preserve well as fossils: their record in stone is 
exiguous in the extreme. Before this report, the 
earliest-known fossil enteropneusts were Trias- 
sic, between 250 million years and 200 million 
years old. That Caron et al. extend the fossil 
record of enteropneusts back to the Cambrian 
period makes their findings notable, even with 
no other contribution. 

The horny tubes of pterobranchs, by con- 
trast, preserve rather well. Amateur geolo- 
gists will know them as graptolites. The 
animals that live inside the tubes, however, 
are scarcely represented in the fossil record. 
And this is why the specimens described by 
Caron et al. are so important. They show early 
enteropneusts that are very similar to modern 
harrimaniids, but that are clearly associated 
with pterobranch-like tubes. This suggests 
that the ancestor of pterobranchs adopted a 
tube-living habit before becoming colonial 
and sessile, and before evolving elaborate 
tentacles (Fig. 1). For once, the fossil record 
has spoken in a voice that is more or less 
unambiguous. = 


Henry Gee is a senior editor of Nature. 
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X-ray analysis on the nanogram to 
microgram scale using porous complexes 


Yasuhide Inokuma!, Shota Yoshioka’, Junko Ariyoshi', Tatsuhiko Arai!, Yuki Hitora*, Kentaro Takada’, Shigeki Matsunaga’, 


Kari Rissanen* & Makoto Fujita’ 


X-ray single-crystal diffraction (SCD) analysis has the intrinsic limitation that the target molecules must be obtained as 
single crystals. Here we report a protocol for SCD analysis that does not require the crystallization of the sample. In our 
method, tiny crystals of porous complexes are soaked in a solution of the target, such that the complexes can absorb the 
target molecules. Crystallographic analysis clearly determines the absorbed guest structures along with the host 
frameworks. Because the SCD analysis is carried out on only one tiny crystal of the complex, the required sample 
mass is of the nanogram-microgram order. We demonstrate that as little as about 80 nanograms of a sample is 
enough for the SCD analysis. In combination with high-performance liquid chromatography, our protocol allows the 
direct characterization of multiple fractions, establishing a prototypical means of liquid chromatography SCD analysis. 
Furthermore, we unambiguously determined the structure of a scarce marine natural product using only 5 micrograms 


of the compound. 


Chemists need reliable methods to analyse and determine molecular 
structures. Nuclear magnetic resonance (NMR) and mass spectrometry 
are indispensable tools in daily chemical research for rapidly analysing 
molecular structures, but, strictly speaking, they provide only speculative 
molecular structures that are sometimes assigned incorrectly. However, 
X-ray SCD provides direct structural information at the atomic level and 
is recognized as the most reliable structure determination method’. 
Unfortunately, X-ray SCD has some critical limitations. First, the crys- 
tallization of samples before measurement can not be automated and 
usually requires a time-consuming trial-and-error procedure. Second, 
the method is in principle not applicable to non-crystalline molecules. In 
this Article, we describe an advance in crystallographic analysis based on 
a new X-ray analysis protocol that does not require the crystallization of 
the sample molecules themselves. 

Our idea is to use networked porous metal complexes*” as ‘crystalline 
sponges”. Owing to the high molecular-recognition ability of the pores, 
the crystalline sponges can absorb target sample molecules from their 
solution into the pores, rendering the incoming molecules regularly 
ordered in the crystal. Accordingly, the molecular structure of the 
absorbed guest will be displayed, along with the host framework, by 
the crystallographic analysis of the networked porous complexes. We 
emphasize that even trace amounts of samples (<0.1 jig) can be analysed 
by this method because the experiment can be performed with only one 
tiny crystal (<0.1 mm to a side). In the following discussion, we thus 
describe the crystallographic analysis of non-crystalline compounds 
and nanogram-microgram-scale X-ray crystallography based on our 
method. The great advantage of trace-amount X-ray analysis is particu- 
larly emphasized by its application to liquid chromatography SCD ana- 
lysis (see below), where high-performance liquid chromatography 
(HPLC) fractions are directly collected by the crystalline sponge and 
analysed by X-ray crystallography. Furthermore, we successfully deter- 
mine the structure of a scarce marine natural product, miyakosyne A, 
including the absolute configuration of its chiral centre, which could not 
be determined by conventional chemical and spectroscopic methods. 


X-ray crystallography of liquid samples 

In this study, two networked complexes are used as crystalline sponges: 
{[(Co(NCS)2)3(1)4]*¢x(solvent)},, (2; ref. 8) and {[(ZnI,)3(1)2]*x(solvent)},, 
(3; ref. 9). Both of these can be easily prepared from tris(4-pyridyl)- 
1,3,5-triazine (1) and a metal salt (Co(NCS), and Znl,, respectively). 
Complex 2 has a three-dimensionally networked cage framework, the 
unit structure of which is identical to that of a discrete MeL 4 cage host"® 
(4; M = ethylenediamine-capped Pd(m) ion, L = ligand 1). The frame- 
work of 3 is completely different from that of 2, but the void spaces of 
both 2 and 3 show strong binding properties for incoming guest mole- 
cules, similar to the molecular recognition by discrete cage 4 in solu- 
tion. The as-synthesized complexes 2 and 3 contain solvents in the 
void, which can be exchanged for other guests by soaking the crystals 
of 2 and 3 in a guest solution. The crystallographic analysis of guest- 
exchanged 2 and 3 has been reported*''“, but primarily a large excess 
of solid guests have been examined. Focusing on the crystallographic 
analysis of non-crystalline guests on a small scale, we first developed a 
procedure for one-crystal-scale guest uptake with only a drop of liquid 
guest (Fig. 1a). When a crystal of 2 was brought into contact with a drop 
of cyclohexanone (5) at room temperature (25 °C), rapid guest penetra- 
tion occurred. After 2 d, the crystal was subjected to an X-ray diffraction 
experiment. As expected, the crystal data clearly showed the molecular 
structure of 5 packed within the cavity of the unit cage in a host/guest 
ratio of 1:4 (Fig. 1b). In a similar manner, complex 3 also enclathrated a 
liquid guest, isoprene (6), which is a highly volatile alkene with a melting 
point below —145 °C (Fig. 1c). Once absorbed into a crystal of 3, guest 6 
was no longer volatile and its structure was easily determined by X-ray 
crystallography without any special treatment. 

It is worth mentioning that the absorbed guests are rarely observed 
by X-ray analysis with common porous coordination networks*7'*""*. 
Such networks are mostly characterized by their permanent vacuum 
pores, which rapidly absorb small molecules (normally gases), presum- 
ably in a kinetic way. In contrast, the void spaces of as-synthesized 
crystals of 2 and 3 are filled with highly fluid solvent molecules. Guest 
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Figure 1 | X-ray crystallographic observation of liquid guest molecules 
using crystalline sponges. a, Schematic outline for the preparation of a guest- 
included network complex: a single piece of crystal 2 or 3 was treated for 2d 
with a drop of a liquid guest and subjected to X-ray data collection. The 
experiment with isoprene was carried out in a sealed vial to avoid the 


molecules can slowly penetrate these ‘wet’ cavities by guest exchange, 


and are concentrated at the molecular-recognition pockets surrounded 
by ligand 1. A characteristic of the strong host-guest interaction in the 
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Figure 2 | Nanogram-scale guest inclusion with a crystal of crystalline 


sponge 3. a, Photograph of a microvial including a single crystal of crystalline 


sponge 3 (solvent, cyclohexane) and a CH;Cl,/cyclohexane (1:9) solution of 


Guest solution 


Host crystal 


evaporation of the guest. b, X-ray crystal structure of cyclohexanone 

(5) observed in the cavities of 2. c, Structure of isoprene (6) observed in the 
pores of 3. In the network structures, liquid guests are shown as space-filling 
models. Thermal ellipsoids are drawn at the 50% probability level. 


crystals of 2 and 3 lies in panel ligand 1, which attracts various guests 
onto its electron-deficient m-plane’®”’. The entire process takes place 
under thermodynamic control, similar to host-guest complexation in 


[ec 
100 um 


of the vial highlighted with a red box in the left picture. b, Microgram of the 
single crystal of 3 placed in a microvial. c, Network structure of guaiazulene 
(7) included in crystalline sponge 3 and ORTEP drawing (at the 50% 


guaiazulene 7 (500 ng/50 ull). The schematic illustration represents the bottom _ probability) of the guest 7. (C, grey; H, white). 
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Figure 3 | Crystal structures of a variety of guests determined using a one- 
crystal-scale inclusion protocol. Guest inclusion and X-ray diffraction 
measurement were conducted for each compound with 5 ig of sample and one 


solution. As a result, the geometry of the included guests is well equili- 
brated and regularly ordered, making the crystallographic analysis of 
the accommodated guests possible. The equilibration of guest geo- 
metry, which is an easily overlooked point in rendering the guests in 
order, does not normally occur in kinetic guest capture by the perma- 
nent pores of coordination networks, so the guests are severely disor- 
dered and have rarely been observed by X-ray analysis**”». 


Nanogram-microgram-scale X-ray analysis 
A great advantage of this method is particularly clear when it is 
applied to the structural analysis of trace amounts of samples. 
Given that the experiment is performed with only one tiny crystal, 
the required sample amount is estimated to be of the nanogram- 
microgram order. Because the guest-accessible void of 3 accounts 
for ~50% of the lattice, one tiny crystal (typically 100 1m to a side; 
~1 ug) theoretically requires only ~0.5 pg of organic guest (density, 
~lgcm °), indicating that X-ray crystallographic analysis can be 
performed on nanogram-microgram scales. Furthermore, if only 
one of the recognition sites in 3 firmly traps the guest, it is not neces- 
sary for the rest of the void space to bind the guest. Thus, the average 
occupancy of the guest in the crystal can be much lower than ~ 100%, 
further decreasing the lower limit of the sample amount. 
Encouraged by the theoretical estimation, we examined the one-crys- 
tal-scale guest inclusion using only 500 ng of guaiazulene (7), a natural 
hydrocarbon. A_cube-shaped crystal of 3 (solvent = cyclohexane; 
60 X 60 X 70 jim*) was placed in a microvial with 45 ull of cyclohexane. 
A CH,Cl, solution of guest 7 (500 ng, 5 ul) was added to the supernatant 
and the microvial was capped with a screw cap with a pinhole as an 
exhaust outlet (Fig. 2). The solvent slowly evaporated through the pin- 
hole in an incubator maintained at 45°C over 2d. As the volume of 
supernatant decreased, the crystal of 3, which was initially colourless, 
gradually turned dark blue, indicating the concentration of 7 in the 


Figure 4 | The crystal structure of a chiral guest, santonin, trapped in a 
crystalline sponge. a, Network structure of the clathrate comprising santonin 


network crystal 3. a, Chemical structures of guests. Me, methyl. b, Inclusion 
crystals prepared with 5 ig of sample. c, ORTEP representation of the crystal 
structures of the guests in the pores of 3. 


crystal. The crystal was removed from the vial and directly mounted 
onto an X-ray diffractometer to collect the diffraction data. The crystal 
structure thus obtained showed 7 trapped in the pores with 60% occu- 
pancy (Fig. 2c). Importantly, two methyl and one isopropyl substituent 
on the azulene core were readily apparent from the observed electron 
density map. 

From the occupancy of 7, the total amount of the guest actually 
included in the crystal was estimated to be only 26 ng, which accounts 
for less than 10% of the guest added to the supernatant. This result 
motivated us to reduce the guest amount further. Surprisingly, when 
only 80 ng of guest 7 was used with a crystal of 3 (80 < 80 X 80 um’), 
guest 7 was still clearly observed. Considering that the experiment was 
carried out using a laboratory X-ray machine, we presume that crys- 
tallography is possible with synchrotron X-ray experiments even on a 
mass of <10 ng. 

For practical reasons, a typical protocol for trace-amount crystal- 
lography was established with 5 ig of sample (Methods Summary). 
Six appropriate samples were selected and one of us (S.Y.) performed 
the trace-amount crystallography with only ~5 pg of each sample and 
without any knowledge of the structures. In this blind test, three 
structures were fully determined from only the diffraction data 
(Fig. 3). The other three were initially flawed because of atom mis- 
assignment, symmetry problems and guest disorder, which are com- 
mon problems in crystallographic analysis. However, the incorrect 
structures could be easily corrected using only the mass spectrometric 
data (molecular weight information). Thus, like common crystal- 
lographic analysis, the refined crystal structures should be supported 
by other methods of structural analysis such as mass and NMR 
spectrometry. The successful structural determination of these 
compounds indicates that the scope of guests can be consider- 
ably extended to include polycyclic, non-aromatic and non-planar 
molecules. 


(8) and complex 3. b, ORTEP drawing (50% probability) of the santonin 
trapped in the pore, and chemical structure of santonin. 
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Figure 5 | LC-SCD analysis of natural flavonoids. a, Sketch of LC-SCD 

analysis. Each fraction isolated by analytical HPLC is poured into a microvial 
that contains a single crystal of crystalline sponge. b, HPLC chromatogram of 
PMFs extracted from the peel of Citrus unshiu. a.u., arbitrary units. c—e, X-ray 


Absolute structure determination 


The determination of the absolute configuration of chiral compounds 
is one of the most difficult analyses of molecular structures****. NUR 
and other spectroscopic methods can determine in principle only 
relative stereochemistry. X-ray crystallography is the only method 
that can determine the absolute configuration of chiral molecules, 
on the basis of the anomalous scattering effects of heavy atoms”””°. 
Surprisingly, we were able to determine the absolute structure of a 
trace amount (5 1g) of a chiral compound because the heavy atoms 
(Zn and I) in the host framework (3) showed clear anomalous scatter- 
ing. As-synthesized coordination network3 has an achiral space 
group (C2/c), but when treated with 5 jig of santonin (8), an anthel- 
minthic drug bearing four chiral centres*’, the space group changed to 
chiral P2, owing to the enclathration of the chiral guest in the pores. 
The refined absolute structure was validated by the Flack parameter 
(0.092(18); parenthetical error, s.d.) and the stereochemistry of 8 was 
unambiguously determined (Fig. 4). It is noteworthy that, in contrast 
to common absolute structure determinations’, the chemical 
introduction of heavy atoms on the guest skeleton is unnecessary 
because the host framework contains heavy atoms and shows 
enhanced anomalous scattering effects. We believe that our method 
is the most practical and reliable method for the determination of the 
absolute structure of chiral compounds. 


LC-SCD analysis 


The requisite guest amount in our protocol matches the separation 
scale of analytical HPLC (a few micrograms). This prompted us to 
propose a combination of liquid chromatography and SCD, namely 
LC-SCD analysis. In this analysis, a trace amount of a sample mixture 
is separated by HPLC and each fraction is directly treated with crys- 
talline sponge 3. Subsequently the molecular structure of each fraction 
is analysed by X-ray crystallography. As a proof of concept, we con- 
sidered the structural determination of a series of polymethoxyfla- 
vones (PMFs) that are trace components in orange peel****. A crude 
mixture of PMFs (~30 1g) extracted from a piece of air-dried orange 
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crystal structures of fraction A, nobiletin (5,6,7,8,3’,4’-hexamethoxyflavone) 
(9; c); fraction B (3,5,6,7,8,3',4’-heptamethoxyflavone) (10; d); and fraction C, 
tangeritin (5,6,7,8,4'-pentamethoxyflavone) (11; e) obtained by LC-SCD 
analysis. (C, black; O, red; H, grey). 


peel, Citrus unshiu, was separated by analytical HPLC to give three 
main components, A, B and C (Fig. 5), that weighted 6, 7 and 5 ig, 
respectively. Each fraction was then directly subjected to the guest 
inclusion conditions with one single crystal of 3. The X-ray crystal 
structures of the three samples all showed a flavone ring trapped at a 
similar position in the host pores. On the basis of the X-ray structures, 
components A, B and C were identified as PMFs9, 10, and 11, 
respectively. These structures are in good accordance with their mass 
data (Supplementary Information). It is worth noting that, despite the 
PMFs’ small, simple structures, the determination of their substi- 
tution positions is unexpectedly difficult even with the use of NMR 
spectroscopy. Considering the applicable range of our X-ray protocol, 
the LC-SCD analysis will be a powerful tool for the rapid character- 
ization of multiple components with much higher structural reliabi- 
lity than liquid chromatography mass spectrometry and LC-NMR 
techniques. 


Complete stereochemistry of miyakosyne A 


Using our method, we took on the considerable challenge of deter- 
mining the structure of miyakosyne A, a scarce natural product. 
Miyakosyne A (12), a marine natural product recently isolated from 
a marine sponge Petrosia sp., has three chiral centres on its main alkyl 
chain”’ (Fig. 6a). Although the absolute configurations at C3 and C26 
have been previously determined to be 3R and 26R, respectively, 
conventional spectroscopic and chemical methods are no longer 
effective for the determination of the absolute configuration at C14 
because the difference between the two long alkyl groups on C14 is 
only one methylene unit. X-ray crystallography is expected to be the 
only method that can determine the C14 configuration, but prepara- 
tion of a single crystal of 12 from only a limited amount is an extre- 
mely challenging task. Hence, we applied our method to the full 
characterization of miyakosyne A to determine the absolute config- 
uration at C14. 

A tiny crystal of 3 (approximately 100 X 100 X 100 1m*) was 
treated with 5 ug of miyakosyne A (12) and subjected to a diffraction 
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Figure 6 | Structural determination of miyakosyne A. a, Chemical structure 
of miyakosyne A (12). Numbers indicate carbon atoms; asterisks indicate 

asymmetric carbon atoms. b, Orientation of miyakosyne A enclathrated in the 
pore of crystalline sponge 3. The host framework 3 is drawn as a brown surface, 


study. The crystal structure was successfully solved in the chiral C2 
space group and revealed all the stereochemistry of 12. On the basis of 
the known 3R and 26R configurations, the C14 configuration of 12 
was determined to be S (Fig. 6b, c). Because the occupancy of the 
solved guest 12 was ~50% due to disordered guests and remaining 
solvents in the pores, we carefully checked the validation of the 
crystal structure. We found that the observed electron density map 
(2F, — F.) can be well superimposed on the refined structure of 
(3R,14S,26R)-12, and that when least-squares refinement was per- 
formed with an incorrectly modelled 14R form without any restraints 
around C14, the refinement did not converge to this configuration but 
converged finally to the 14S form. This result is a striking example of 
the structural determination of a scarce natural product and shows 
that our method can be expanded in scope to cover the structural 
elucidation of even large and conformationally flexible compounds. A 
variety of different types of compound can be quickly and readily 
analysed in a similar way. 


Conclusion 


Our method solves the real and intrinsic problems of X-ray crystal- 
lography and transforms it into a rapid and convenient method for the 
analysis of molecular structures using only a trace amount of sample. 
The following features are worthy of special attention. First, the crys- 
tallization step, which is the bottleneck of the X-ray analysis proto- 
col, becomes unnecessary. Therefore, the crystallographic study of 
molecular structures is drastically accelerated and is now applicable 
to the analysis of liquid or even volatile compounds. Second, crystal- 
lographic analysis can be preformed on the nanogram-microgram 
scale. Thus, in terms of sensitivity, X-ray analysis overwhelmingly 
dominates NMR analysis and is even comparable to mass spectro- 
metry. The combination of HPLC and single-crystal X-ray analysis 
(LC-SCD analysis) has been achieved. Third, the determination of the 
absolute configuration of chiral molecules can be easily carried out 
without any chemical modification of the sample molecules. 

The slow diffusion of guests into the solvent-filled voids of the 
crystalline sponges is a particularly important process that renders 
the guests thermodynamically well equilibrated at the specific 
molecular-recognition sites of the hosts. Accordingly, the guests are 
regularly ordered and made observable by crystallographic analysis. 
This is not always the case with common porous coordination net- 
works, in which the permanent pores absorb guests in presumably a 
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and the guest is represented as a ball-and-stick model. (C, green; O, red; H, 
white) c, The absolute structure of miyakosyne A determined by relative 
configuration to C3 and C26. The chiral centre C14 is drawn in magenta. 


kinetic way. Because the crystalline sponges used here are easily 
prepared from commercially available components, the nanogram- 
microgram-scale crystallographic analysis of non-crystalline mole- 
cules is already practicable. We expect that our method will be 
applicable to microanalysis in natural product chemistry, food and 
perfume science, drug discovery, forensic science, and, most impor- 
tantly, the daily research of synthetic chemists**“°. We believe that 
many natural and synthetic compounds that chemists have almost 
given up hope of analysing crystallographically will be easily and 
precisely characterized by this method. 


METHODS SUMMARY 


A cube-shaped crystal of 3 (approximately 100 X 100 X 100 um?) was placed in 
the bottom of a microvial with 45 ll of cyclohexane. A solution of guest com- 
pound (5 1g) in dichloromethane (5 ul) was added to the supernatant, and the 
microvial was capped with a screw cap pierced with a hollow needle (internal 
diameter, 0.8 mm). The solvent of the supernatant was slowly evaporated in an 
incubator maintained at 45 °C over 2 data rate of ~1 ph’. The guest inclusion 
crystal was directly mounted onto a single-crystal X-ray diffractometer with a 
cryoloop and the diffraction data were collected. The crystal structures were 
solved and refined by using the SHELX 97 program. Further details and other 
methods are available in Supplementary Information. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Chemicals. Solvents and reagents were purchased from TCI, WAKO Pure 
Chemical Industries and Sigma-Aldrich, and were used without any further 
purification except where noted. 

Glassware. Microvials for this research were purchased from Waters 
(Deactivated Clear Glass 12 X 32 mm Screw Neck Max Recovery Vial; 1.5 ml). 
A syringe needle, Terumo NN-2116R (internal diameter, 0.80 mm), was used as 
an exhaust outlet for the cap of the microvial. An incubator (Fine FF-12) was used 
to maintain the temperature during guest inclusion. 

Equipment. HPLC separation of microgram-scale natural products was per- 
formed on a Jasco UV-970 spectrometer equipped with a Jasco PU-980 pump 
and an ODS column Develosil ODS-5, (Nomura Chemical Co.). Elemental ana- 
lysis was performed on a CHN/O/S elemental analyser (CE-440, Exeter 
Analytical). Mass spectra were recorded on a Bruker maXis spectrometer. 
Single-crystal X-ray diffraction data were collected on a Bruker APEX-II CCD 
diffractometer equipped with a focusing mirror (MoK, radiation wavelength, 
0.71073 A) and an N> generator (Japan Thermal Engineering Co.) or a Rigaku 
VariMax with RAPID (CuK, radiation wavelength, 1.54187 A). Microscopic 
infra-red spectra were recorded on a Varian DIGILAB Scimitar instrument. 
For X-ray diffraction and microscopic infrared measurements, fluorolube was 
used as a protectant for the single crystals. 

X-ray structural analysis. The structures were solved by direct methods and 
refined by full-matrix least-squares calculations on F* (the squared structure 
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factor), using the SHELX97 program. Residual electron densities in the 
solvent-accessible void due to disordered solvent molecules were treated with 
the PLATON/SQUEEZE program. The quite large (3e-9e A’) residual electron 
density peaks causing checkCIF B level alerts in SQUEEZE are all located near the 
iodine atoms in the corresponding structures and could not be modelled with 
reasonably disordered atoms. 

Plant material. Citrus unshiu was cultivated in Nishi- Uwajima, Ehime prefec- 
ture, Japan in 2011. The peel of C. unshiu was air-dried at room temperature over 
1 week and ground into a fine powder with a lab mixer. 

Inclusion of cyclohexanone (5) into a crystal of 2. On a glass plate, a crystal of 2 
(280 X 280 X 200 tm") was soaked in a drop of cyclohexanone (5) (5 11) at room 
temperature. After standing for 2 d, the crystal was picked using a protectant and 
mounted onto the X-ray diffractometer. After collection of the diffraction data, 
the same crystal was subsequently analysed by microscopic Fourier transform 
infrared spectroscopy. 

Inclusion of isoprene (6) into a crystal of 3. In a microvial, a crystal of 3 
(80 X 50 X 50 um?) was soaked in a drop of isoprene (6) (5 tl). Then the vial 
was sealed with a screw cap and allowed to stand for 2 d at room temperature. The 
crystal was then analysed as described above. 

Nanogram-scale inclusion of guaiazulene (7). To a microvial containing a single 
crystal of 3 (approximately 80 X 80 X 80 um?) and cyclohexane (45 il), we added 
a dichloromethane solution of guaiazulene (7) (80 ng/5 pl or 500 ng/5 pil). The 
crystal was heated at 45 °C for 2d, as described in the general procedure. 


©2013 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


doi:10.1038/nature11922 


Mutations in prion-like domains in 
hnRNPA2BI1 and hnRNPAI cause 
multisystem proteinopathy and ALS 


Hong Joo Kim*, Nam Chul Kim!*, Yong-Dong Wang*, Emily A. Scarborough**, Jennifer Moore!*, Zamia Diaz**, Kyle S. MacLea*, 
Brian Freibaum', Songqing Li', Amandine Molliex', Anderson P. Kanagaraj', Robert Carter®, Kevin B. Boylan®, 

Aleksandra M. Wojtas°, Rosa Rademakers®, Jack L. Pinkus’, Steven A. Greenberg’, John Q. Trojanowski®, Bryan J. Traynor’, 
Bradley N. Smith®, Simon Topp’? Athina-Soragia Gkazi!®, Jack Miller’®, Christopher E. Shaw!°, Michael Kottlors"!, 
Janbernd Kirschner", Alan Pestronk'’, Yun R. Li!’, Alice Flynn Ford’, Aaron D. Gitler'*, Michael Benatar’, Oliver D. King"*, 
Virginia E. Kimonis!’, Eric D. Ross*, Conrad C. Weihl'”, James Shorter? & J. Paul Taylor! 


Algorithms designed to identify canonical yeast prions predict that around 250 human proteins, including several 
RNA-binding proteins associated with neurodegenerative disease, harbour a distinctive prion-like domain (PrLD) 
enriched in uncharged polar amino acids and glycine. PrLDs in RNA-binding proteins are essential for the assembly 
of ribonucleoprotein granules. However, the interplay between human PrLD function and disease is not understood. 
Here we define pathogenic mutations in PrLDs of heterogeneous nuclear ribonucleoproteins (hnRNPs) A2B1 and Al in 
families with inherited degeneration affecting muscle, brain, motor neuron and bone, and in one case of familial 
amyotrophic lateral sclerosis. Wild-type hnRNPA2 (the most abundant isoform of hnRNPA2B1) and hnRNPA1 show 
an intrinsic tendency to assemble into self-seeding fibrils, which is exacerbated by the disease mutations. Indeed, the 
pathogenic mutations strengthen a ‘steric zipper’ motif in the PrLD, which accelerates the formation of self-seeding 
fibrils that cross-seed polymerization of wild-type hnRNP. Notably, the disease mutations promote excess incorporation 
of hnRNPA2 and hnRNPAI into stress granules and drive the formation of cytoplasmic inclusions in animal models that 
recapitulate the human pathology. Thus, dysregulated polymerization caused by a potent mutant steric zipper motif ina 
PrLD can initiate degenerative disease. Related proteins with PrLDs should therefore be considered candidates for 


initiating and perhaps propagating proteinopathies of muscle, brain, motor neuron and bone. 


Elucidating the genetic basis of rare, inherited diseases can provide 
valuable insights to the molecular pathogenesis of common diseases. 
Inclusion body myopathy (IBM) associated with Paget’s disease of 
the bone (PDB), fronto-temporal dementia (FTD) and amyotrophic 
lateral sclerosis (ALS)—sometimes called IBMPFD/ALS—is a rare 
disorder characterized by progressive degeneration of muscle, brain, 
motor neurons and bone accompanied by prominent TDP-43 (also 
called TARDBP) pathology’. Patients with this rare, inherited syn- 
drome experience features of IBM, FTD, ALS or PDB indistinguish- 
able from familial and sporadic cases of these disorders, and the 
disease may manifest in several tissues in the same patient’”. 
Recently the name multisystem proteinopathy (MSP) has been 
adopted to reflect the expanding phenotype and prominent protein- 
aceous pathology that characterizes this syndrome. Some, but not all, 
cases of MSP are caused by mutations in the VCP gene’, which 
encodes the AAA+ ATPase VCP, a ubiquitin-dependent segregase. 


The discovery that VCP mutations cause MSP led to the subsequent 
discovery of pathogenic VCP mutations in more common diseases 
such as sporadic or familial forms of ALS’, FTD*, IBM® and PDB’. 
These rare MSP families represent a unique opportunity to identify 
fundamental molecular defects shared among age-related diseases; 
thus it is highly desirable to identify additional genetic mutations 
responsible for this syndrome. 


A pathogenic mutation in hnRNPA2B1 in MSP 


We identified a family (family 1) with dominantly inherited degen- 
eration affecting muscle, bone, brain and motor neurons that was 
clinically indistinguishable from previous families we have seen with 
VCP-related MSP (Fig. 1a, Supplementary Fig. 1 and Supplementary 
Table 1). Sequencing the entire VCP gene (including introns and exons) 
in affected patients revealed no synonymous or nonsynonymous variants. 
Genetic analysis of this family by exome sequencing and linkage analysis 
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Red = AVFPMILW (small + hydrophobic) 


Figure 1 | Identification of previously unknown disease mutations in MSP 
and ALS. a, Family 1 pedigree indicating individuals affected by dementia, 
myopathy, PDB and ALS. The causative mutation was p.D290V/D302V in 
hnRNPA2B1. Roman numerals denote generation and Arabic numbers denote 
family member within a generation. b, Family 2 pedigree indicating individuals 
affected by myopathy and PDB. The causative mutation was p.D262V/D314V 
in hnRNPAI1.c, The pedigree of a family with ALS. The causative mutation was 
p-D262N/D314N in hnRNPA1. d, e, Sequence alignment of hnaRNPA2B1 

(d) and hnRNPAI (e) orthologues showing evolutionary conservation of the 


in parallel (Supplementary Fig. 1) identified a single, previously unknown 
variant (c.869/905A>T, p.D290V/D302V) that co-segregates with disease 
and alters the gene encoding hnRNPA2B1, a ubiquitously expressed 
RNA-binding protein (Fig. la). hnRNPA2B1 is expressed as two alterna- 
tively spliced isoforms: A2 and B1. The shorter hnRNPA2, which lacks 12 
amino acids in the amino-terminal region, is the main isoform, accounting 
for ~90% of the protein in most tissues. The identified mutation substi- 
tutes a valine residue in the place of an evolutionarily conserved aspartate 
residue (Fig. 1d) that is centred in a motif conserved in multiple human 
paralogues of the hnRNP A/B family (Fig. 1f and Supplementary Fig. 3). 


Pathogenic mutations in hnRNPAI in MSP and ALS 


Additional validation of the pathogenicity of the hnRNPA2B1 mutation 
came from the analysis of family 2. The clinical features of this family with 
VCP-negative MSP have been previously described’. To identify the 
culprit mutation we took the same strategy as for family 1 (Supplemen- 
tary Fig. 2 and Supplementary Table 2). This analysis identified five 
previously unknown single nucleotide variants and one indel that co- 
segregated with the disease. Of these, a mutation in hnRNPAI (c.785/ 
941A>T, p.D262V/D314V) stood out because it was the only variant 
associated with a conserved residue, it was predicted to be deleterious, 
and because hnRNPA1 is highly expressed in affected tissues (Fig. 1b and 
Supplementary Fig. 2). Moreover, this mutation in hnRNPA1 was identi- 
cal to that found in hnRNPA2B1 in family 1, creating a substitution of a 
valine residue in the place of a highly conserved aspartate residue that is 
centred in a motif conserved in multiple human paralogues of the hnRNP 
A/B family (Fig. le, f and Supplementary Fig. 3). Subsequent to the 
identification of these MSP-causing mutations we screened the exomes 
of 212 familial ALS cases for sequence variants affecting hnRNPA2B1 or 
hnRNPAL. In one dominantly inherited case in which known ALS genes 
were formally excluded, we identified a mutation (c.784/940G>A; 
p-D262N/D314N) affecting the identical, conserved aspartate residue 
in hnRNPA1 (Fig. lc, e, f). 

The identification of disease mutations in hnRNPA2B1 and 
hnRNPAI was intriguing for three reasons. First, hnaRNPA2B1 and 
hnRNPAL interact directly with TDP-43 and function cooperatively 
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mutated aspartate and surrounding residues. Gaps (black lines) are inserted 
between residues so that identical or similar amino acids are aligned in 
successive columns. f, Sequence alignment of four human paralogues of the 
hnRNP A/B family in which the disease-affected residue and surrounding 
residues are highly conserved. Acidic residues (Asp, Glu) are in blue, basic 
residues (Arg, His, Lys) are in magenta, uncharged polar amino acids (Ser, Thr, 
Tyr, Asn, Gln) and Gly and Cys are in green, and nonpolar amino acids aside 
from Gly and Cys (Ala, Val, Phe, Pro, Met, Ile, Leu, Trp) are in red. 


to regulate RNA metabolism®. Second, an unbiased genetic screen 
previously identified fly homologues of TDP-43, hnRNPA2B1 and 
hnRNPAI as suppressors of VCP-related degeneration in a Dro- 
sophila model of MSP? (Supplementary Fig. 3). Last, hnaRNPA2B1 
has been previously implicated in neurodegenerative disease. 
Specifically, hnRNPA2B1 is sequestered in RNA foci in the fragile- 
X-associated tremor ataxia syndrome (FXTAS)”°, binds the expanded 
riboCGG repeats that underlie this disease’*"’, and is a genetic modi- 
fier in a Drosophila model of FXTAS’*”. 


hnRNPA2BI and hnRNPAI pathology in MSP 


Muscle biopsies from patients II5 (family 1) and IV9 (family 2) 
showed atrophic fibres, central nuclei and rimmed vacuoles char- 
acteristic of IBM (Supplementary Fig. 4a—c). Whereas in normal mus- 
cle hnRNPA2B1 and hnRNPA1J are exclusively nuclear (Fig. 2a, e and 
Supplementary Fig. 4d), analysis of muscle tissue from patient II5 
showed that hnhRNPA2B1 cleared from many nuclei and accumulated 
in cytoplasmic inclusions in ~10% of fibres (Fig. 2b and Supplemen- 
tary Fig. 4e). Muscle from this patient also showed TDP-43 pathology 
consisting of nuclear clearance and cytoplasmic inclusions, consistent 
with previous observations in VCP-related and sporadic IBM (Fig. 2j 
and Supplementary Fig. 4p)’. Interestingly, hnRNPA2B1 pathology 
was also observed in VCP-related and sporadic IBM (Fig. 2c, d and 
Supplementary Fig. 4h, i). 

Analysis of muscle from patient IV9 also revealed nuclear clearance 
and cytoplasmic inclusions of hnhRNPA1I in ~10% of fibres (Fig. 2f 
and Supplementary Fig. 4j). hnRNPA2B1 and TDP-43 pathology 
were also observed in the muscle biopsy from patient IV9 (Fig. 2h, 
k and Supplementary Fig. 4f, g). Moreover, hnRNPA1 pathology was 
also observed in VCP-related and sporadic IBM (Fig. 2g and data 
not shown). Therefore, regardless of aetiology we have found 
hnRNPA2B1, hnRNPAI and TDP-43 pathology in sporadic and 
familial IBM. Pathological redistribution of hnaRNPA2B1, hnRNPA1 
and TDP-43 was not observed in other types of muscle disease (Sup- 
plementary Fig. 4q). FUS (also known as TLS)-associated pathology 
was not observed in these cases, and thus pathological redistribution 
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Figure 2 | Cytoplasmic pathology of hnRNPA2B1 and hnRNPAI. 

a-d, Immunohistochemical analysis of hnRNPA2B1 (red) in normal muscle 
(control, a), muscle biopsy from patient II5 from family (F) 1 (b), a patient with 
MSP caused by a VCP mutation (R155H) (c) and a patient with sporadic IBM 
(sIBM, d). hnRNPA2B]1 (red) was cleared from 4’,6-diamidino-2-phenylindole 
(DAPI)-stained nuclei (blue) and accumulated in cytoplasmic inclusions (b- 
d). e~g, Immunohistochemical analysis of hnRNPA1I (red) in a normal muscle 
(e), a muscle biopsy from patient IV9 from family 2 (f) and a patient with MSP 
caused by a VCP mutation (R155H) (g). hnRNPA1 was cleared from nuclei and 
accumulated in cytoplasmic inclusions (f, g). h, In muscle tissue from patient 
IV9 (family 2), hnRNPA2B1 was cleared from nuclei and accumulated in 
cytoplasmic inclusions. (i-1) Immunohistochemical analysis of TDP-43 (red) 
in anormal muscle (i), muscle biopsy from patient IIS from family 1 (j), muscle 
biopsy from patient IV9 from family 2 (k) and a patient with MSP caused by a 
VCP mutation (R155H) (1). TDP-43 (red) was cleared from DAPI-stained 
nuclei (blue) and accumulated in cytoplasmic inclusions (j- 

1). m-p, Immunohistochemical analysis of hnRNPA2B1 and TDP-43 co- 
localization in a patient with sporadic IBM. DAPI (m), TDP-43 

(n), hnRNPA2B1 (0) and merged images (p) are shown. hnRNPA2B1 (red) 
and TDP-43 (green) were cleared from nuclei (arrowheads) in an atrophic 
muscle fibre, but found in nuclei of neighbouring unaffected fibres (arrows). 
Scale bars, 50 um. 


does not affect all hnRNPs. Double staining revealed that fibres with 
TDP-43 pathology typically also showed hnhRNPA2B1 and hnRNPA1 
pathology (Fig. 2m—p and Supplementary Fig. 4g, k). In these instances 
there was often partial co-localization of TDP-43, hnRNPA2B1 and 
hnRNPAI. We also observed limited immunopositivity for ubiquitin 
and p62 (Supplementary Fig. 41-o). 


Disease mutations affect hnRNPA2B1 and Al PrLDs 


hnRNPA2B1 and hnRNPAI each have carboxy-terminal, glycine-rich 
domains that are essential for activity and mediate interaction with 
TDP-43 (ref. 8). FoldIndex predicts these domains to be intrinsically 
unfolded (Fig. 3a, b). Interestingly, these low-complexity domains have 
an amino-acid composition similar to that of yeast prion domains. 
Indeed, a previously devised algorithm” predicted that the C-terminal 
regions of hnRNPA2 (residues 185-341) and hnRNPAI (residues 186- 
320) are prion like (Fig. 3a—d), whereas these domains narrowly missed 
the cutoff for prion propensity according to a different algorithm’”’. 
Interestingly, similar PrLDs are present in many hnRNPs, including 
TDP-43 and FUS (Supplementary Fig. 5 and Supplementary Table 3). 
The disease-causing mutations fall at the centre of the predicted PrLD, 
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and, according to both algorithms, are expected to enhance prion-like 
behaviour (Fig. 3a, b). 

We also examined hnRNPA2B1 and hnRNPAI with ZipperDB, a 
structure-based threading algorithm, which scores six-amino-acid 
segments or their propensity to form two self-complementary beta 
strands termed ‘steric zippers’ that form the spine of amyloid fibrils'®. 
Hexapeptides with Rosetta energy lower than —23 kcal mol ' are pre- 
dicted to form steric zippers, with lower energy predicting higher amy- 
loidogenicity. ZipperDB predicted that disease mutations increase the 
potency of steric zipper motifs present in the PrLDs of hnRNPA2B1 
and hnRNPAI (Fig. 3c-f). This analysis is discussed in greater detail 
in Supplementary Fig. 6. Taken together, multiple algorithms predict 
that the PrLDs of hnaRNPA2B1 and hnRNPAI are intrinsically dis- 
ordered, but poised to access higher order self-templating structures. 
Notably, the disease mutations are centred within these PrLDs and are 
predicted to promote transition to an ordered structure. 


Disease mutations accelerate hnRNP fibrillization 


We directly tested the predictions that (1) hnRNPA2 and hnRNPA1 
are prone to fibrillization and (2) that this property is enhanced by 
disease-causing mutations. First, we experimentally assessed the 
ZipperDB prediction for the impact of disease mutations on steric 
zipper motifs found in hnRNPA2B1 and hnRNPAI. Notably, the 
synthetic mutant hexapeptides of hnRNPA2B1(D290V) (Asn-Tyr- 
Asn-Val-Phe-Gly) and hnRNPA1(D262V) (Ser-Tyr-Asn-Val-Phe- 
Gly) rapidly assembled into amyloid fibrils, as shown by thioflavin 
T fluorescence and electron microscopy, whereas the corresponding 
wild-type peptides did not, even after several weeks (Fig. 4a—d). Thus, 
the disease-causing mutations in hnRNPA2B1 and hnRNPAI gene- 
rate highly amyloidogenic hexapeptides precisely as predicted. The 
more potent steric zippers that result from disease mutations are likely 
to be important for two reasons. First, introduction of similarly potent 
steric zippers is sufficient to force fibril formation, even in model 
proteins that do not ordinarily fibrillize’’. Second, in hnhRNPA1 and 
hnRNPA2 these potent steric zippers are centred in the intrinsically 
disordered PrLD and are thus available to make intermolecular con- 
tacts and drive fibril formation. 

We next assessed the fibrillization propensity of full-length pro- 
teins. We purified bacterially expressed, glutathione S-transferase- 
tagged wild-type and mutant hnRNPA2 and hnRNPAI as soluble 
proteins under native conditions and assessed fibrillization by sedi- 
mentation analysis and electron microscopy. Wild-type hnRNPA2 
and hnRNPA1 were both intrinsically aggregation prone with a lag 
phase of ~4h (Fig. 4e, g). Electron microscopy revealed that wild-type 
hnRNPA2 and hnRNPAI form fibrils (Fig. 4f, h). The pronounced lag 
suggests that a critical rate-limiting step in hnhRNPA2 and hnRNPA1 
fibrillization is nucleation. Indeed, a small quantity (5% wt/wt) of 
preformed hnRNPA2 or hnRNPAI fibrils greatly accelerated the 
assembly of hnRNPA2 or hnRNPAI, respectively (Supplementary 
Fig. 7a, e). By contrast, preformed hnRNPA2 fibrils did not seed 
assembly of hnRNPA1 (Supplementary Fig. 7e). Likewise, pre-formed 
hnRNPAI fibrils did not seed assembly of hnRNPA2 (Supplementary 
Fig. 7a). Thus, wild-type hnRNPA1 and hnRNPA2 spontaneously 
form self-seeding fibrils. 

Importantly, the disease mutations greatly accelerated hnRNPA2 
and hnRNPAI fibrillization (Fig. 4e-h). In all cases, the lag phase was 
curtailed and fibrillization was well advanced while the wild-type 
protein remained in lag phase. Thus, the disease-causing mutations 
directly promote nucleation of hnRNPA2 and hnRNPA1 into fibrils. 
Moreover, fibrils formed by hnhRNPA2(D290V), hnRNPA1(D262V) 
and hnRNPA1(D262N) not only seeded their own assembly 
(Supplementary Fig. 7b, f, g), but also promoted fibrillization of their 
respective wild-type counterparts (Supplementary Fig. 7d, i). Neither 
hnRNPA2(D290V) fibrils nor hnaRNPA1(D262V) fibrils seeded the 
assembly of TDP-43, another RNA-binding protein with a PrLD. 
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Figure 3 | The disease mutations affect a PrLD in 
hnRNPA2B1 and hnRNPAI. a, b, FoldIndex 
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Finally, we assessed the importance of the steric zipper motifs in 
fibrillization. We deleted the steric zipper residues 287-292 from 
hnRNPA2 and 259-264 from hnRNPAI and assessed _fibrilliza- 
tion in vitro. Importantly, neither hnRNPA2(A287-292) nor 
hnRNPA1(A259-264) formed fibrils (Fig. 4e, g), even after 24h of 
agitation. Moreover, hnhRNPA2(A287-292) did not fibrillize when 
seeded by fibrils of wild-type or mutant hnaRNPA2 (Supplementary 
Fig. 7c), nor did hnRNPA1(A259-264) fibrillize when seeded by 
fibrils of wild-type or mutant hnRNPA1 (Supplementary Fig. 7h). 
Thus, residues 287-292 of hnaRNPA2 and 259-264 of hnRNPAI are 
critical for spontaneous and seeded fibrillization of the full-length 
protein. Collectively, these findings indicate that steric zipper motifs 
centred in the PrLDs of wild-type hnaRNPA2 and hnRNPAL are criti- 
cal to their intrinsic tendency to fibrillize and that disease mutations 
introduce more-potent steric zippers, which accelerate nucleation and 
polymerization (Supplementary Fig. 6). 


Mutant hnRNPA2BI1 PrLD supports yeast prion activity 


Yeast prion proteins are generally modular, meaning that prion 
domains from one protein can be transferred to another, while retain- 
ing prion activity’*. To determine whether the PrLD of hnaRNPA2B1 
could support prion activity in yeast, we replaced the Sup35 nucleation 
domain (residues 3-40) with the core PrLD (residues 261-303) from 
either wild-type or mutant hnhRNPA2B1 and expressed these fusion 
proteins as the sole copies of Sup35 in the cell. Prion formation was 
detected by monitoring nonsense suppression of the ade2-1 allele and 
verified by monitoring a series of prion-defining criteria, including 
curability with guanidine, transferability with cytoduction and sensi- 
tivity to expression levels (detailed in Supplementary Fig. 8). This 
analysis showed that the hnRNPA2B1 core PrLD can substitute for 
the Sup35 nucleation domain in supporting prion formation, and that 
the D290V mutation specifically promotes the nucleation activity. 
Moreover, full-length hnRNPA1 and hnRNPA2 form cytoplasmic 
aggregates and are toxic in yeast (Supplementary Fig. 9). 


Mutations enhance hnRNP recruitment to RNA granules 


Remarkably, the PrLDs in hnRNPA2B1 and hnRNPAI correspond to 
the ‘low-complexity sequences’ found in various hnRNPs (including 
TDP-43 and FUS) that are essential determinants of RNA granule 
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curve below the dashed green line). All curves 
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in ref. 15. The disease mutations were predicted to 
make these domains more prionogenic (insets). 

c, d, ZipperDB-detected six-amino-acid stretches 
(underlined in e, f) within the core PrLDs for which 
the disease mutations increased the predicted 
amyloid-fibril-forming potential beyond the 
Rosetta threshold. e, f, Domain architecture of 
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recognition motifs 1 and 2 (RRM1 and RRM2), the 
C-terminal glycine-rich domain and an M9 
nuclear-localization signal. The PrLDs are centred 
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assembly”. Stress granules are cytoplasmic ribonucleoprotein gran- 
ules composed of repressed translation complexes**’. TDP-43 and 
FUS are recruited to stress granules and this is enhanced by disease 
mutations”. Thus, we proposed that hnhRNPA2B1 and hnRNPA1 
would be recruited to stress granules and that this would be enhanced 
by disease mutations. To test this, we first examined the subcellular 
localization of endogenous hnRNPA2B1 in HeLa cells before and 
after stress granule induction with arsenite. At baseline, endogenous 
hnRNPA2B1 was localized exclusively to nuclei (Supplementary 
Fig. 10). After arsenite treatment hnRNPA2B1 re-localized to 
eIF4G-positive cytoplasmic puncta, showing the recruitment of endo- 
genous hnRNPA2B1 to stress granules (Supplementary Fig. 10). 

To examine the impact of disease mutations, we expressed Flag- 
tagged versions of wild-type and mutant hnRNPA2. There was signi- 
ficantly greater incorporation of mutant hnRNPA2 into constitutive 
stress granules than wild-type hnRNPA2 (Fig. 5a). Moreover, after 
arsenite treatment mutant hnRNPA2 was incorporated into stress 
granules more rapidly than wild-type hnRNPA2, although by 
30 min their degree of incorporation was similar (Fig. 5b). Both 
wild-type and mutant hnRNPA2 formed high-molecular-weight 
species that accumulated in the detergent-insoluble fraction (Fig. 5c). 
Similar to the kinetics of stress granule incorporation, mutant hnRNPA2 
accumulated as high-molecular-weight species more rapidly than 
wild-type hnRNPA2, although by 30 min their accumulation was com- 
parable (Fig. 5c). Similar results were obtained upon examination of 
endogenous hnRNPAI as well as exogenous wild-type and mutant 
hnRNPAI (Supplementary Fig. 11). Interestingly, hnaRNPA2-positive 
and hnRNPA1-positive stress granules were also immunopositive for 
TDP-43 (Supplementary Fig. 12). We also generated fibroblast cell 
lines from patients [14 and II6 (family 1). In these cells endogenous 
mutant hnRNPA2 accumulated in constitutive stress granules that 
were immunopositive for eIF4G in addition to TDP-43 and VCP 
(Supplementary Fig. 13). 


Mutant hnRNPs form pathological inclusions in vivo 

We generated transgenic Drosophila expressing wild-type or mutant 
forms of human hnRNPA2 and hnRNPAI, as well as wild-type or 
mutant forms of the fly homologue HRB98DE (Supplementary 
Figs 14 and 15). Expression of wild-type hnRNPA2 in Drosophila 
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Figure 4 | Disease mutations accelerate hnRNPA2 and hnRNPAI 
fibrillization. a, Synthetic hexapeptides A2 wild-type (Asn-Tyr-Asn-Asp- 
Phe-Gly) or mutant (Asn-Tyr-Asn-Val-Phe-Gly) were incubated at 25 °C for 
2h. Fibrillization was monitored by thioflavin T (ThT) fluorescence. b, Electron 
micrograph of A2 wild-type or mutant hexapeptides after 10 min at 25 °C. Scale 
bars, 0.1 tum. ¢, d, Fibrillization analysis of Al wild-type (Ser-Tyr-Asn-Asp- 
Phe-Gly) or mutant (Ser-Tyr-Asn-Val-Phe-Gly) as in a, b. Scale bars, 0.5 jum. 
e, Full-length hnRNPA2 wild-type, hnaRNPA2(D290V) or hnRNPA2(A287- 
292) was incubated at 25 °C with agitation for 0-12 h. At various times, the 
amount of aggregated hnRNPA2 was determined. Values represent 

means ~ s.e.m. (n = 3). f, Electron micrograph of hnRNPA2 fibrillization 
reactions after 0, 4 and 12h at 25°C. Note the absence of fibres after 4h for 
wild-type hnRNPA2. Scale bar, 0.5 jum. g, h, Fibrillization of full-length 
hnRNPAI wild-type, hnRNPA1(D262V), hnRNPA1(D262N) or 
hnRNPA1(A259-264), monitored as in e, f. Scale bar, 0.5 um. 


indirect flight muscle led to mild degeneration affecting the rostral 
portion of several muscles, whereas expression of hnRNPA2(D290V) 
caused severe degeneration that affected all muscles (Fig. 6a). 
Importantly, flies overexpressing hnRNPA2(A287-292), in which 
the hexapeptide (Asn-Tyr-Asn-Asp/Val-Phe-Gly) in the PrLD was 
deleted, had virtually normal muscles (Fig. 6a). This result indicates 
that the severe toxicity found in flies overexpressing mutant 
hnRNPA2 requires the presence of this potent steric zipper motif. 

Immunohistochemical analysis showed that wild-type hnRNPA2 
localized appropriately to nuclei, as did hnRNPA2(A287-292), whereas 
hnRNPA2(D290V) largely accumulated in cytoplasmic inclusions 
(Fig. 6b). We observed a correlation between cytoplasmic inclu- 
sion formation and solubility of hnRNPA2. Specifically, mutant 
hnRNPA2(D290V) protein was largely recovered from the detergent- 
insoluble fraction, whereas wild-type hnaRNPA2 and hnRNPA2(A287- 
292) proteins were mostly found in the detergent-soluble fraction 
(Fig. 6c, d). Thus, the degree of muscle degeneration in flies expressing 
hnRNPA2 correlates with the extent of cytoplasmic inclusions and 
with hnRNPA2 solubility. Entirely consistent results were obtained 
upon expression of wild-type and mutant versions of hnhRNPA1 and 
HRB98DE (Supplementary Fig. 15). 

Last, we evaluated the impact of the disease mutation on hnaRNPA2 
cellular localization in mammalian muscle. We electroporated mouse 
tibialis anterior with plasmid expressing Flag-tagged versions of 
wild-type or mutant hnRNPA2. Wild-type hnRNPA2 was localized 
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Figure 5 | hnRNPA2 recruitment to stress granules is accelerated by disease 
mutation. a, HeLa cells were transfected with Flag-tagged wild-type or mutant 


hnRNPA2 and stained with anti-Flag (green), anti-eIF4G (red) and DAPI 
(blue). Arrows indicate hnaRNPA2- and eIF4G-positive stress granules. Scale 
bar, 20 um. b, HeLa cells were transfected as in a, treated with 0.5 mM sodium 
arsenite for the indicated time, and immunostained as in a. The percentage of 
cells displaying hnRNPA2-positive stress granules (SGs) at indicated time points 
after treatment with arsenite cells are plotted. Data represent mean =~ s.e.m. 
n= 3; ***P <0.001. c, HeLa cells were transfected and stimulated as in b, and 
sequentially extracted with RIPA (radio immunoprecipitation assay) buffer and 
urea buffer. Immunoblotting was conducted with anti-Flag, anti-TDP-43 or 
anti-actin (loading control) antibodies. 


appropriately within nuclei, whereas mutant hnaRNPA2(D290V) was 
excluded from nuclei and accumulated in cytoplasmic inclusions 
(Supplementary Fig. 16), highly reminiscent of the pathology seen 
in MSP patients. 


Discussion 

Polymerization of PrLDs underlies the orderly, regulated phase trans- 
ition that drives assembly of non-membrane-bound organelles includ- 
ing RNA granules, which serve as ‘reaction crucibles’ in which many 
aspects of RNA metabolism occur”’. Disease mutations introduce more 
potent steric zippers into the PrLDs of hnRNPA2 and hnRNPAI, dysre- 
gulating and accelerating nucleation and polymerization, altering the 
dynamics of RNA granule assembly, which likely has adverse con- 
sequences for RNA metabolism. In addition to hnRNPA2 and 
hnRNPAI, at least four other RNA-binding proteins that harbour 
PrLDs (TDP-43, FUS, EWSR1 and TAF15) accumulate in disease 
pathology”’. By analogy to hnRNPA2B1 and hnRNPAI, we speculate 
that these and perhaps other PrLD-containing proteins contribute to 
the initiation or propagation of disease by a similar mechanism. 
Indeed, ~250 human proteins are predicted to harbour PrLDs, many 
of which are RNA-binding proteins (Supplementary Fig. 5 and Sup- 
plementary Table 3), and should be considered candidates for contri- 
buting to degenerative disease. 

Thus, disease could ensue from unregulated polymerization initiated 
spontaneously by PrLDs upon environmental stress, or by a mutation 
affecting either the PrLD or another factor (for example, VCP) that 
regulates the ribonucleoprotein granule assembly with altered meta- 
bolism of RNA as a consequence. 
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Figure 6 | Mutant hnRNPA2 forms cytoplasmic inclusions in Drosophila. 
a, Adult flies were dissected to expose the dorsal longitudinal indirect flight muscle 
and stained with Texas Red-phalloidin (red) and DAPI (blue). Flies expressing 
human wild-type hnRNPA2 under control of the Mhc-Gal4 driver showed mild 
degeneration, whereas flies expressing mutant human hnRNPA2 show severe 
degeneration affecting all muscles. Flies expressing hnRNPA2(A287-292) show 
muscle histology similar to flies expressing wild-type hnRNPA2. Ntg indicates 
non-transgenic control. b, Wild-type hnRNPA2 localizes exclusively to nuclei, 
whereas hnRNPA2(D290V) also accumulates extensively in cytoplasmic 
inclusions. hnRNPA2(A287-292) localizes exclusively to nuclei. c, Thoraces of 
adult flies were dissected and sequential extractions were performed to examine 
the solubility profile ofhnRNPA2. d, Quantification of the blot shown in c. Data 
represent mean ~ s.e.m. n = 3; *P < 0.05, ***P < 0.001; two-way analysis of 
variance with Bonferroni’s post hoc test. 


Notably, diseases associated with pathological inclusions of PrLD- 
containing proteins frequently show ‘spreading’ pathology, in which 
cellular degeneration with intracellular inclusions starts in one epi- 
centre and subsequently spreads to neighbouring tissue**. Although 
not directly addressed here, this study suggests that cell-to-cell trans- 
mission of a self-templating conformer could contribute to the 
spreading pathology that is characteristic of these diseases. 

The discovery of VCP mutations in MSP’ led to the subsequent 
discovery of pathogenic VCP mutations in sporadic or familial forms 
of ALS’, FTD*, IBM° and PDB®. Therefore, subsequent to the dis- 
covery of mutations in hnRNPA2B1 and hnRNPAI underlying 
MSP, we screened 212 familial ALS cases and identified a pathogenic 
hnRNPA1 mutation in one family, as described earlier. As this manu- 
script was being assembled for publication we completed screening of 
305 sporadic ALS cases and identified a non-synonymous variant in 
hnRNPAI (c.800/956A>G, p.N267S/N319S) in one classic, late- 
onset case in which mutations in known ALS genes had been excluded 
(Supplementary Fig. 17). Although not definitive, this variant is pro- 
bably pathogenic because it is centred in the core PrLD and introduces 
a potent steric zipper similar to that introduced by D262V/N muta- 
tions. The frequency of hnRNPA2B1 and hnRNPAI mutations in 
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ALS, and the possibility that mutations in these genes underlie some 
sporadic and familial forms of FTD, IBM and PDB, will be important 
to address in the future. 


METHODS SUMMARY 


Exome enrichment of 3 1g genomic DNA was performed by using the Agilent 
SureSelect Human All Exon Kit capture library (G3362 for five DNA samples 
from family 1 and G3370 for five DNA samples from family 2) according to the 
manufacturer’s protocol (SureSelect Human All Exon Illumina Paired-End pro- 
tocol version 1.0.1). Captured DNAs were sequenced on an Illumina GAIIx 
sequencer (Illumina) with 76-base-pair (family 1) or 100-base pair (family 2) 
paired-end reads. At least two lanes of sequencing data were collected for each 
sample to generate sufficient coverage. Image analyses and base calling were 
performed by using the Illumina Genome Analyzer Pipeline software (GAPipeline 
version 1.5 or higher) with default parameters. Reads were aligned to a human 
reference sequence (University of California, Santa Cruz (UCSC) assembly hg19, 
National Center for Biotechnology Information (NCBI) build 37) and genotypes 
were called at all positions at which there were high-quality sequence bases (Phred- 
like score Q25 or greater) at minimum coverage of 8, using CLC Genomics 
Workbench v4.5.1 (CLC Bio). For each sample, more than 50% of at least 100 
million reads were uniquely mapped to the targeted human exon regions to give 
mean depth of coverage of 123. For linkage analysis samples were genotyped using 
the Affymetrix Genome-Wide Human SNP Array 6.0. Single nucleotide poly- 
morphism genotype calls were performed according to the standard Affymetrix 
protocols. A subset of single nucleotide polymorphisms chosen on the basis of 
heterozygosity was used for linkage analysis, using dChip software (http:// 
www.hsph.harvard.edu/cli/complab/dchip/) under parametric linkage analysis 
and a dominant model with 99% penetrance in heterozygotes and a disease allele 
frequency of 0.001. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Patients. Patients were examined by M.B. and J.P.T. except where indicated. The 
Institutional Review Board of the University of Miami approved the study pro- 
tocol and all participants provided written informed consent. 

Exome capture and next-generation sequencing. Exome enrichment of 3 1g 
genomic DNA was performed by using the Agilent SureSelect Human All 
Exon Kit capture library (G3362 for five DNA samples from family 1 and 
G3370 for five DNA samples from family 2) according to the manufacturer’s 
protocol (SureSelect Human All Exon Illumina Paired-End protocol version 
1.0.1). The capture library, containing regions totalling approximately 38 mega- 
bases (G3362) or 50 megabases (G3370), is designed to target all human exons in 
the NCBI consensus coding sequence (CCDS) database. Captured DNAs were 
sequenced on an Illumina GAIIx sequencer (Illumina) with 76-base-pair (family 
1) or 100-base-pair (family 2) paired-end reads. At least two lanes of sequencing 
data were collected for each sample to generate sufficient coverage. Image ana- 
lyses and base calling were performed by using the Illumina Genome Analyzer 
Pipeline software (GAPipeline version 1.5 or higher) with default parameters. 
Reads were aligned to a human reference sequence (UCSC assembly hg19, NCBI 
build 37) and genotypes were called at all positions at which there were high- 
quality sequence bases (Phred-like score Q25 or greater) at minimum coverage of 
8, using CLC Genomics Workbench v4.5.1 (CLC Bio). For each sample, more 
than 50% of at least 100 million reads were uniquely mapped to the targeted 
human exon regions to give a mean depth of coverage of 123. Under such coverage, 
approximately 94% of targeted regions were covered by 5 reads or more, and more 
than 85% were covered by more than 20 reads (Supplementary Fig. 1). To identify 
the pathogenic mutations, we performed a series of filtering steps (Supplementary 
Fig. 1). We first discarded the variants that did not change the amino-acid 
sequence. We then generated a list of variants that are present in affected indivi- 
duals but absent in nonaffected individuals. Variants previously reported in the 
Single Nucleotide Polymorphism Database build 132 (dbSNP132) hosted by the 
National Center for Biotechnology Information were excluded. The variants 
were further filtered by using the in-house database (a collection of variants from 
whole-exome sequencing of more than 625 individuals) by a similar methodology. 
Finally Sanger sequencing was performed on DNA of three additional members 
(both affected and nonaffected) from each family to exclude the nonpathogenic 
mutations. Presence of mutations was confirmed in all candidate disease-causing 
variants by Sanger sequencing. 

Linkage analysis. Eight DNA samples (parents and six siblings) from family 1 
were genotyped using the Affymetrix Genome-Wide Human SNP Array 6.0. SNP 
genotype calls were performed according to the standard Affymetrix protocols. A 
subset of SNPs chosen on the basis of heterozygosity was used for linkage analysis, 
using dChip software (http://www.hsph.harvard.edu/cli/complab/dchip) under 
parametric linkage analysis and a dominant model with 99% penetrance in het- 
erozygotes and a disease allele frequency of 0.001. Five genomic segments with a 
size greater than 100 kilobases, at least 20 SNP markers, and a log odds ratio (lod) 
score greater than 0 are shown in Supplementary Fig. 1. 

Generation of phylogram for hnRNP A/B gene family. TBPH of D. melanoga- 
ster was used to BLAST (basic local alignment search tool) against the RefSeq 
protein database including species of M. musculus, D. melanogaster, A. gambiae, 
H. sapiens, D. rerio (zebrafish), C. elegans and G. gallus (jungle fowl). The top 100 
sequences (according to BLAST E values) were obtained. Sequences sharing more 
than 98% similarity were filtered by UCLUST* because they likely represent 
alternatively spliced variants of the same genes. Eighty-one sequences remaining 
after this process were aligned using the MAFFT multiple sequence alignment 
program’®. The DNA-coding sequences of all proteins were extracted and 
threaded onto the protein sequences to form an in-frame codon alignment. 
Squint” was used to extract the DNA-coding alignment of the two RRM domains 
present in each protein using the protein alignment as a guide. Phylogenetic trees 
of the codon alignment including 21 genes from only human and Drosophila were 
generated using the genetic algorithm-based GARLI** method. 

Constructs. Complementary DNAs encoding human hnRNPA2 (accession 
number NM_002137.2) or hnRNPAI (accession number NM_002136.1) in the 
plasmid pCMV6-XL5 were obtained from OriGene. A Flag epitope tag was added 
to the 5’ end of hnRNPA2 or hnRNPA1 by PCR. The cDNAs were then sub- 
cloned into a pcDNA 3.1 plasmid, using restriction sites EcoRI and Xhol, creating 
pcDNA 3.1-Flag-hnRNPA2 or -Flag-hnRNPA1. cDNAs encoding Drosophila 
HRB98DE (accession number NM_170373.1) were obtained from the Droso- 
phila Genomics Research Center. Missense mutations of hnRNPA2 (hnRNPA2 
(D290V)), hnRNPA1 (hnRNPA1(D262V)) and Drosophila HRB98DE (HRB98DE 
(D302V)) were generated by site-directed mutagenesis. Sequence analysis was 
conducted on all plasmid constructs. 

Antibodies. The following commercial antibodies were used in this study: 
mouse monoclonal anti-hnRNPA2B1 antibodies EF-67 and DP3B3 (Santa 


Cruz Biotechnology), rabbit polyclonal anti-TDP-43 antibody (Protein Tech 
Group), anti-eIF4G antibody (Santa Cruz Biotechnology), anti-hnRNPAI 
antibody (Cell Signaling Technology), anti-VCP antibody (Epitomics) and 
anti-Flag M2 antibody (Sigma). Anti-hnRNPAI antibodies (4B10 and 9H10) 
were generous gifts from G. Dreyfuss. Validation of these antibodies is shown 
in Supplementary Fig. 18. 

Cell culture and transfection. HEK293 cells and HeLa cells were grown in 
Dulbecco’s modified Eagle’s medium (DMEM) supplemented with 10% fetal 
bovine serum (FBS), 1% penicillin/streptomycin, and 1% L-glutamate. Human 
fibroblasts were grown in DMEM nutrient mixture F-12 (DMEM/F12) supple- 
mented with 20% FBS, 1% penicillin/streptomycin and 1% L-glutamate. Cells 
were transfected using Lipofectamine LTX with Plus Reagent (Invitrogen), 
according to the manufacturer’s instructions. 

RNA interference. Short interfering RNAs (siRNAs) against hnaRNPA2B1 were 
ordered from Santa Cruz Biotechnology (sc-43841, target sequences: 5’- 
CAGUUCCGUAAGCUCUUUAtt, GGAUGGCUAUAAUGGGUAUtt-3’ and 
5'-GGAUCAUGGUGUAAUAAGAtt-3’) and Dharmacon (ON-TARGETplus 
SMARTpool, human HNRPA2B1 (3181), target sequences: 5'-CGGUGGA 
AAUUUCGGACCA-3’, 5'-GCUGUUUGUUGGCGGAAUU-3’, 5'-GGAGAG 
UAGUUGAGCCAAA-3’ and 5’-GAGGAGGAUCUGAUGGAUA-3’). Lower- 
case letters indicate non-targeting 3’-overhang included for stability. Control 
siRNAs were ordered from Dharmacon (ON-TARGETplus Non-targeting 
siRNA no. 1). HeLa cells were transfected in six-well plates with Lipofectamine 
RNAiMax (Invitrogen) according to the manufacturer’s protocol and collected 
after 48 and 72h. 

Immunofluorescence studies. Cells were fixed in 4% paraformaldehyde in PBS 
buffer, permeabilized with 0.5% Triton X-100 in PBS for 10 min, blocked with 5% 
goat serum in PBS for 30 min and incubated with primary antibody for 2h at 
room temperature (20-25 °C) or overnight at 4°C. Primary antibodies were 
visualized with secondary antibodies conjugated with Alexa Fluor 488 and 
Alexa Fluor 594 (Molecular Probes, Invitrogen) and nuclei were detected using 
DAPI. Stained cells were examined using a confocal microscope (Zeiss LSM 510 
NLO Meta) with Zeiss ZEN software. 

Solubility and biochemical analyses. Sequential extractions were performed to 
examine the solubility profile of hnRNPA2B1. Cells were washed twice with PBS, 
lysed in cold RIPA buffer (50 mM Tris, pH 7.5, 150 mM NaCl, 1% Triton X-100, 
0.5% sodium deoxycholate, 0.1% SDS and 1mM EDTA) and sonicated. Cell 
lysates were cleared by centrifugation at 100,000g for 30 min at 4 °C to generate 
RIPA-soluble samples. To prevent carry-overs, the resulting pellets were washed 
twice with PBS (that is, re-sonicated and re-centrifuged). Only supernatants from 
the first centrifugation were analysed. RIPA-insoluble pellets were then extracted 
with urea buffer (7 M urea, 2 M thiourea, 4% CHAPS (3-[(3-cholamidopropyl)- 
dimethylammonio]-1-propanesulphonate), 30 mM Tris, pH 8.5), sonicated and 
centrifuged at 100,000g for 30 min at 22 °C. Protease inhibitors were added to all 
buffers before use. Protein concentration was determined by the bicinchoninic 
acid method (Pierce) and proteins were resolved by NuPAGE Novex 4-12% Bis- 
Tris Gel (Invitrogen). 

In vivo electroporation of mice. All animal work was performed with the 
approval of the St Jude Children’s Research Hospital Committee on Animal 
Care. Mice were anesthetized and the skin overlying the tibialis anterior muscle 
was shaved, and the animals were injected with endotoxin-free Flag-tagged wild- 
type hnRNPA2 or Flag-tagged hnRNPA2(D290V) mutant diluted in sterile 
PBS to a volume of 50 il by using a 0.5-ml syringe fitted with a 29-gauge needle. 
Two-needle array electrodes (BTX Technologies) were inserted into the muscle 
immediately after DNA delivery for electroporation. The distance between the 
electrodes was 5mm, and the array was inserted longitudinally relative to the 
muscle fibres. In vivo electroporation parameters were as follows: voltage, 75 V; 
pulse length, 50 ms; number of pulses, 6; pulse interval, 200 ms; desired field 
strength, 200 V cm}, given by a BTX ECM830 Electro Square Porator. After 7 
days of recovery, mice were processed for immunofluorescence analysis. 
Muscle histochemistry and immunohistochemistry. Human samples were pro- 
cessed for routine histochemical analysis as previously described’’. For skeletal 
muscle immunohistochemistry, isolated skeletal muscle was mounted by using 
tragacanth gum and quick-frozen in liquid-nitrogen-cooled 2-methylbutane. 
Sections (8 jum) were placed on slides and fixed in 4% paraformaldehyde for 
5 min and then ice-cold acetone for 5 min. Sections were incubated with PNB 
blocking reagent (Perkin Elmer) for 1 h and then incubated overnight in primary 
antibody diluted in PNB. After serial washes in 1X PBS, slides were incubated in 
secondary antibody diluted in PNB for 2h. Sections were mounted with Mowiol 
containing DAPI. Specimens were examined using a fluorescent microscope (80i 
upright; Nikon) and charge-coupled device camera (EZ monochrome; Roper 
Industries) with deconvolution software analysis (NIS Elements; Nikon). 
Nonfluorescent images were taken with a 5-megapixel colour charge-coupled 


©2013 Macmillan Publishers Limited. All rights reserved 


device (Nikon). Image processing and analysis were performed with NIS 
Elements 4.0 software and Photoshop CS3 (Adobe). 

Yeast strains, media and plasmids. Yeast cells were grown in rich media (YPD) 
or in synthetic media lacking uracil and containing 2% glucose (SD/—Ura), 
raffinose (SRaf/—Ura) or galactose (SGal/—Ura). The TDP-43 and FUS yeast 
expression constructs have been previously described”~*!. Wild-type and 
IBMPFD-associated mutant hnaRNPA2B1 and hnRNPA1 Gateway entry clones 
were generated by PCR, incorporating the flanking Gateway attB1 and attB2 sites 
along with a Kozak consensus sequence. Resulting PCR products were shuttled 
into pDONR221, using a Gateway BP Clonase reaction. Two versions of each 
entry clone (with or without stop codon) were then used in LR reactions with 
pAG416Gal-ccdB, pAG426Gal-ccdB, pAG416Gal-ccdB-GFP or pAG426Gal- 
ccdB-GFP” to generate the CEN (centromeric) and 2-j1m untagged or YFP- 
fusion constructs. Primer sequences are available upon request. To prevent 
unwanted recombination events owing to unstable repetitive sequences in the 
hnRNPA2B1 and hnRNPA1 DNA sequence, we propagated plasmids in SURE-2 
or STBL-3 Escherichia coli at 30 °C. For testing the phenotype of wild-type and 
IBMPFD-associated mutant hnRNPA2 as Sup35-fusion proteins, PCR was used 
to amplify amino acids 261-303 of hnRNPA2 and insert it into Sup35 in the place 
of amino acids 3-40. The resulting PCR products were co-transformed with 
BamHI/HindIlI-cut pJ526 (from D. Masison, NIH; ref. 33) into strain 780-1D 
(MAT kar1-1 SUQ5 ade2-1 his3A202 leu2A1 trp1A63 ura3-52 sup35::KanMX) 
containing the SUP35 maintainer plasmid pJ533 (from D. Masison; ref. 34). 
Transformants were selected on medium lacking leucine, and then transferred 
to 5-fluoroorotic-acid-containing medium to select for loss of pJ533. Sequences 
were confirmed by DNA sequencing. Overexpression plasmids were generated 
by PCR amplification, restriction digest and cloning into compatible sites in 
pKT24 (from K. Taylor, NABI), which contains a GALI promoter, as previously 
described”. 

Yeast transformation and spotting assays. Yeast procedures were performed 
according to standard protocols*. The polyethylene glycol/lithium acetate 
method was used to transform yeast with plasmid DNA”. For spotting assays, 
yeast cells were grown overnight at 30 °C in SRaf/— Ura until they reached log or 
mid-log phase. Cultures were then normalized for Dgoo nm: serially diluted, spot- 
ted onto synthetic solid media containing SD/— Ura or SGal/— Ura, and grown at 
30 °C for 2-3 days. hnRNPA2B1-Sup35 fusions were tested for prion formation 
and for stability and curability of the Ade’ phenotype as previously described*>. 
Cytoductions were performed as previously described**. The recipient strain 
was YER746 (MATa karl-1 SUQS5 ade2-1 his3 leu2 trp1 ura3 argl::HIS3 
sup35::KanMx), carrying plasmid pER697 (URA3), which expresses the 
hnRNPA2(D290V)-Sup35 fusion from the SUP35 promoter. 

Fluorescence microscopy for yeast. For fluorescence microscopy experiments, 
single-colony isolates of yeast strains were grown to mid-log phase in SRaf/—Ura 
media at 30 °C. Cultures were spun down and re-suspended in the same volume 
of SGal/— Ura to induce expression of the hnRNPA2B1 or hnRNPAI constructs. 
Cultures were induced with galactose for 4-6h and processed for microscopy. 
Images were obtained by an Olympus IX70 inverted microscope and a 
Photometrics CoolSnap HQ 12-bit CCD camera. Z stacks of several fields were 
collected for each strain. The images were de-blurred using a nearest neighbour 
algorithm in the DeltaVision softWoRx software and representative cells were 
chosen. 

Hexapeptide assembly. The wild-type hnRNPA2 (Asn-Tyr-Asn-Asp-Phe-Gly), 
mutant hnRNPA2(D290V) (Asn-Tyr-Asn-Val-Phe-Gly), wild-type hnRNPA1 
(Ser-Tyr-Asn-Asp-Phe-Gly) and mutant hnRNPA1(D262V) (Ser-Tyr-Asn- 
Val-Phe-Gly) hexapeptides were synthesized at the Keck Biotechnology 
Resource Laboratory at Yale University School of Medicine. Al peptides were 
dissolved at 1 mM and A2 peptides at 5mM in 150mM KCl, 40mM HEPES- 
KOH, pH 7.4 and 1 mM dithiothreitol and used immediately for assembly reac- 
tions. Fibre-assembly reactions were monitored at Xex/em 440/482 nm, in the 
presence of 25 tM thioflavin T, at room temperature on a Tecan Safire’ or Tecan 
Infinite M1000 plate reader. Alternatively, reactions were processed for electron 
microscopy as for the full-length hnRNPs (see below). 

hnRNP purification. Wild-type and mutant hnRNPA2 or hnRNPAI1 were 
expressed and purified from E. colias GST-tagged proteins. Expression constructs 
were generated in pDuet to contain a TEV-cleavable site, resulting in a GST-TEV- 
hnRNP construct***'. GST-TEV-hnRNP was overexpressed in E. coli BL21- 
CodonPlus(DE3)-RIL cells (Agilent) and purified under native conditions using 
a glutathione-sepharose column (GE) according to the manufacturer’s instruc- 
tions. Proteins were eluted from the glutathione sepharose with assembly buffer 
(AB; 40mM HEPES-KOH, 150mM KCI, 5% glycerol, 20mM _ glutathione, 
pH7.4). Protein was centrifuged for 10 min at 16,100g, and supernatant was 
separated from pellet to remove any protein aggregates. Protein concentration 
was determined by Bradford assay (Bio-Rad) in comparison to BSA standards. 
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In vitro fibril-formation assay. For GST-tagged protein, aggregation was 
initiated by adding TEV protease (Invitrogen) to GST-TEV-hnRNPA2 (3 [1M) 
or GST-TEV-hnRNPAI (5 1M) in AB. Aggregation reactions were incubated at 
25°C for 0-12h with or without agitation at 1,200 r.p.m. in an Eppendorf 
Thermomixer. For self-seeded and cross-seeded reactions, hnRNPA2, 
hnRNPA2(D290V), hnRNPA1I or hnRNPA1(D262V) fibrils were assembled at 
5 uM for 24h with agitation at 1,400 r.p.m. in AB. These preformed fibrils then 
used to seed (5% wt/wt) the assembly of hnRNPA2 (2.5 1M), hnRNPA2(D290V) 
(2.5 uM), hnRNPA1I (5 uM), hnRNPA1(D262V) (5 4M) or hnRNPA1(D262N) 
(5 tM) in AB as indicated. Here, assembly reactions were agitated at 1,200 r.p.m. 
at 25 °C. For hnRNPA2 cross-seeding reactions the amount of preformed fibrils 
added was increased to 10% wt/wt and reactions were not agitated. In hnaRNPA1 
cross-seeding reactions, preformed fibrils were briefly sonicated to fragment 
fibrils before addition to the assembly reaction. For hnRNPAI cross-seeding 
reactions the amount of preformed fibrils added was 5% wt/wt and reactions 
were not agitated. For sedimentation analysis, samples were centrifuged at 
16,100g for 10 min at 25 °C. Pellet fractions were resolved by SDS-PAGE and 
stained with Coomassie blue. The amount of protein in the pellet fraction was 
determined by densitometry in comparison to known amounts of hnRNP. For 
electron microscopy of in vitro aggregation reactions, samples (10 ul) were 
adsorbed onto glow-discharged 300-mesh Formvar/carboncoated copper grid 
(Electron Microscopy Sciences) and stained with 2% (w/v) aqueous uranyl acet- 
ate. Excess liquid was removed, and grids were allowed to air dry. Samples were 
viewed by a JEOL 1010 transmission electron microscope. 

Fly stocks and culture. The wild-type hnRNPA2 and mutant hnRNPA2(D290V) 
cDNAs were subcloned into the pUASTattB plasmid, using restriction sites EcoRI 
and Xhol, creating pUASTattB-wild type hnRNPA2 or pUASTattB-mutant 
hnRNPA2(D290V). Wild-type HRB98DE and mutant HRB98DE(D302V) were 
subcloned into pUASTattB plasmid using NotI and Xhol sites. Flies carrying 
pUASTattB transgenes were generated by a standard injection and (C31 inte- 
grase-mediated transgenesis technique. To express a transgene in muscles, Mhc- 
Gal4 was used (from G. Marqués). All Drosophila stocks were maintained in a 
25 °C incubator with a 12-h day/night cycle. 

Adult fly muscle preparation and immunohistochemistry. Adult flies were 
embedded in a drop of OCT compound (Sakura Finetek) on a slide glass, frozen 
with liquid nitrogen and bisected sagitally by a razor blade. After fixing with 4% 
paraformaldehyde in PBS, hemithoraces were stained by Texas Red-X phalloidin 
(Invitrogen) and DAPI according to manufacturer’s instructions. Stained hemi- 
thoraces were mounted in 80% glycerol, and the musculature was examined by 
DMIRE2 (Leica, 10). For hnRNPA2 staining, hemithoraces were permeabilized 
with PBS containing 0.2% Triton X-100 and stained with anti-hnRNPA2B1 
(EF-67) antibody (Santa Cruz Biotechnology) and Alexa-488-conjugated second- 
ary antibody (Invitrogen). Stained muscle fibres were dissected and mounted in 
Fluormount-G (Southern Biotech) and imaged with a Marianas confocal micro- 
scope (Zeiss, X63) 

Fly western blotting. Thoraces of adult flies were prepared and ground in PBS 
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CLP1 links tRNA metabolism to 
progressive motor-neuron loss 
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CLP1 was the first mammalian RNA kinase to be identified. However, determining its in vivo function has been elusive. 
Here we generated kinase-dead Clp1 (Clp1*’*) mice that show a progressive loss of spinal motor neurons associated with 
axonal degeneration in the peripheral nerves and denervation of neuromuscular junctions, resulting in impaired motor 
function, muscle weakness, paralysis and fatal respiratory failure. Transgenic rescue experiments show that CLP1 
functions in motor neurons. Mechanistically, loss of CLP1 activity results in accumulation of a novel set of small RNA 
fragments, derived from aberrant processing of tyrosine pre-transfer RNA. These tRNA fragments sensitize cells to 
oxidative-stress-induced p53 (also known as TRP53) activation and p53-dependent cell death. Genetic inactivation 
of p53 rescues Clp1*’* mice from the motor neuron loss, muscle denervation and respiratory failure. Our experiments 


uncover a mechanistic link between tRNA processing, formation of a new RNA species and progressive loss of lower 


motor neurons regulated by p53. 


RNA molecules undergo co- and post-transcriptional processing, 
leading to mature, functional RNAs. In mammals and archaea 
CLP1 proteins are kinases that phosphorylate the 5’ hydroxyl ends 
of RNA‘. Human CLP1 is a component of the messenger RNA 3’- 
end cleavage and polyadenylation machinery**, and studies in yeast 
have postulated a function for CLP1 in coupling mRNA 3’-end pro- 
cessing with RNA polymerase II (Pol II) transcriptional termination’ *. 
Unique to mammals is the association of CLP1 with the tRNA splicing 
endonuclease (TSEN) complex’. TSEN proteins remove the intron 
present within the anticodon loop of numerous pre-transfer RNAs 
(pre-tRNAs), generating 5’ and 3’ tRNA exon halves’. Within the 
TSEN complex, CLP1 phosphorylates 3’ tRNA exons in vitro’, poten- 
tially contributing to tRNA splicing in mammals’. Although CLP1 
may participate in multiple RNA pathways and was the first mam- 
malian kinase identified that phosphorylates RNA, the in vivo function 
of CLP1 in mammalian cells has remained elusive. 

We report the generation and phenotypic analysis of CLP1 kinase- 
dead mice. These mice develop a progressive loss of lower motor 
neurons, resulting in fatal deterioration of motor function. We also 
show that inactivation of CLP1 kinase activity results in the accumula- 
tion of previously unreported tyrosine tRNA fragments that sensitize 
cells to activation of p53 in response to oxidative stress. 


Neonatal lethality of Clp1*’* mice 


To assess the in vivo function of CLP1, first we generated global Clp1- 
knockout mice. We never obtained any viable Clp1-null offspring, 
even when analysed at embryonic day (E)6.5, indicating very early 
embryonic lethality. Consequently, we generated mice carrying a 


single amino-acid change, lysine to alanine at position 127 (K127A), 
which is located within the Walker A ATP-binding motif (Sup- 
plementary Fig. la, b). This mutation abolishes CLP1 kinase activity’. 
Mice heterozygous for the K127A substitution (Clp1‘’*) were inter- 
crossed to generate homozygous offspring (Clp1‘““). Western blotting 
showed that the CLP1 K127A mutant protein was expressed at normal 
levels, and that the Clp1‘“ mutation impaired 5’ phosphorylation 
of a small duplex RNA substrate (Supplementary Fig. 1c, d). There- 
fore, we successfully generated a knock-in mouse expressing kinase- 
dead CLP1. 

Clp1*” K mice were born at a normal Mendelian ratio. However, 
on a C57BL/6 background, all Clp1”“ mice died within hours of 
birth, probably owing to respiratory failure (Fig. la, b). This pheno- 
type had complete penetrance (n >50). Embryos and newborn 
Clp1** mice had overtly normal lung development and morphogene- 
sis, as indicated by caveolin 1, surfactant A and surfactant C expres- 
sion (Supplementary Fig. 2). However, all newborn Clp1** mice and 
E18.5 embryos exhibited a lordotic body posture and dropping 
forelimbs, indicative of impaired motor functions (Fig. 1a and Sup- 
plementary Fig. 3a). Newborn Clp1** mice also showed reduced 
birth weight and were hyporesponsive to stimuli (Supplementary 
Fig. 3b); similar phenotypes in KIF1B-mutant mice have been 
ascribed to motorsensory neuronal defects’’. We therefore analysed 
neuromuscular junctions (NMJs) in the diaphragm. 

Control E18.5 embryos showed the characteristic innervation pat- 
tern of the phrenic motor nerve bundle and had normal NMJs, 
defined by colocalization of presynaptic terminals with postsynaptic 
clusters of acetylcholine receptors (AChRs) as well as the presence of 
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Figure 1 | Respiratory failure and impaired innervation of the diaphragm. 
a, b, Appearance (a) and lung histology (b; haematoxylin and eosin) of 
newborn Clp1*’~ and Clp1** littermates on a C57BL/6 background. Scale 
bars: top, 500 um; bottom, 100 pm. c, d, Whole-mount immunostaining of 
diaphragm muscle of E18.5 Clp1*’* and Clp1** littermates showing 
postsynaptic AChR clusters (red, «-bungarotoxin), and innervating motor 
axons and presynaptic nerve terminals (green, neurofilament/synaptophysin 
immunostaining). Asterisks indicate defasciculated main axons mislocalized to 
the periphery of the muscle; arrowheads indicate secondary branches coming 
from the main axon; arrows indicate areas of denervation. Magnifications in 
d indicate severely disturbed NMJs in E18.5 Clp1‘ embryos (arrow). Scale 
bars: ¢, 250 tum; d, 25 fm. 


$100° Schwann cells in the endplate (Fig. 1c and Supplementary Fig. 4). 
E18.5 Clp1\’* embryos showed defasciculation of the main phrenic 
nerve bundle; primary branches were mislocalized to the periphery, 
and denervation of the ventral and dorsal diaphragm was prominent in 
all Clp1’ K mutants (Fig. 1c). NMJs were formed, but axon terminals 
appeared undifferentiated, with smaller AChR clusters (Fig. 1d). S100 
expression at the NMJ was absent in Clp1“”“ embryos, although the 
Schwann cells seemed to be functionally intact as surviving peripheral 
axons were myelinated (Supplementary Fig. 4). The development and 
morphology of the heart, liver, kidney, colon, bladder, spleen and 
thymus appeared normal at E18.5. Thus, all newborn Clp1‘““ mice 
show impaired innervation of the diaphragm, which seems to cause 
lethal respiratory failure and neonatal death. 


Embryonic loss of motor neurons 

We next assessed NMJs in the diaphragm during embryogenesis. In 
Clp1*’ K embryos, denervation was not found at E14.5. However, we 
observed partial denervation and pronounced alteration in NMJ 
morphology at E16.5, followed by a severe defect in the innervation 
of the NMJs of the diaphragm at E18.5 (Fig. 1c, d and Supplementary 
Figs 4-6). Moreover, whereas Clp1”“ embryos had normal numbers 
of choline acetyl transferase (ChAT)-expressing spinal motor neurons 
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at E14.5 and E16.5, the numbers of ChAT* motor neurons markedly 
declined in the spinal cord of E18.5 Clp1‘’* embryos (Fig. 2a, b and 
Supplementary Fig. 7a, b). The numbers of motor neurons also 
declined in wild-type embryos due to pruning, which has been linked 
to oxidative stress exposure’’. NeuN staining to detect all neurons 
showed that the observed reduction in neuronal numbers was due to 
the loss of ChAT* motor neurons (Supplementary Fig. 7c). 

To confirm loss of motor neurons in the spinal cord, we crossed 
green fluorescent protein ads tagged Hb9 (also known as Mnx1) 
transgenic mice onto a Clp1“ background. Genetic GFP tag- 
ging showed that E14.5 Clp1‘““ embryos have similar numbers of 
Hb9* cells as age-matched control embryos, but the numbers 
of Hb9-GFP* cells declined in E18.5 Clp1“”* embryos (Supplemen- 
tary Fig. 7d). In whole-mount visualizations of Hb9-GFP* E10.5 
embryos, both development and segmental-motor-axon outgrowth were 
comparable between Clp1‘* and control embryos; three-dimensional 
reconstructions further demonstrated comparable motor-axon out- 
growth in the thoracic region (segment T6-T7) in E12.5 embryos 
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Figure 2 | Viable Clp1*““ embryos develop neuromuscular atrophy. 

a, Immunostaining for ChAT* (red) motor neurons in the cervical (C4—C5) 
spinal cord of E14.5 and E18.5 Clp1*’* and Clp1** littermates on a C57BL/6 
background. 4’ ,6-Diamidino-2-phenylindole (DAPI) staining is shown. Scale 
bar: 50 um. b, Mean numbers + standard deviation (s.d.) of ChAT* motor 
neurons in the cervical (C4—C5) spinal cord of E14.5, E16.5 and E18.5 Clp1*/* 
and Clp1* littermates. n = 5 mice per group. ***P < 0.001 (t-test). c, Muscle 
weakness (impaired spreading of hind legs) in 3-month-old Clp1* mice. 

d, e, Impaired walking strides in 1-, 3-, 8- and 12-month-old Clp1* mice 
compared to Clp1*’* littermates. Representative images in d are from 12- 
month-old mice, demonstrating shortened walking strides and paralysis in 
Clp1*’* mice. Data in e are mean values + s.d. n = 5 mice per group. 

***P < 0.001 (t-test). f, g, Progressive impairment of motor functions in viable 
Clp1** mice as assessed by the latency to fall in a fixed (f) and accelerated 
(g) Rotarod test. Data (mean values + s.d.) are from 1-month-old (mo) 
Clp1'’* (n = 8) and Clp1* (n = 7) mice, and 4-month-old Clp1’* (n= 16) 
and Clp1** (n = 14) mice. **P < 0.01, ***P < 0.001 (t-test). 
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(Supplementary Fig. 8a, b). Moreover, the overall morphology of 
motor terminal branches and the pathfinding of motor neurons at 
the intercostal muscles was comparable (Supplementary Fig. 8c). 
Thus, genetic inactivation of the RNA kinase function of CLP1 results 
in progressive loss of motor neurons in the cervical and lumbar spinal 
cord, with aberrant innervations and formation of NMJs in the dia- 
phragm, and neonatal death. 


Progressive loss of motor functions 


The 100% lethality of Clp1‘’* pups at birth was observed on a C57BL/6 
background. Similarly, we observed 100% lethality on a BALB/c back- 
ground. However, when crossing the mutation onto a CBA/J mouse 
background, we obtained viable pups homozygous for the Clp1““ 
mutation (Supplementary Fig. 9a). These mice showed motor ataxia 
(Supplementary Fig. 9b), impaired muscle strength, an altered walking 
stride, and diminished balance as determined by the fixed and acce- 
lerated Rotarod tests (Fig. 2c-g), all indications of motor defects. The 
muscle weakness and impaired motor functions observed in these mice 
were progressive (Fig. 2e-g). Mice started to die around week 23 after 
birth, and older Clp1““ mice developed limb paralysis (Fig. 2d and 
Supplementary Fig. 9a, c). Sensory responses to noxious heat, mecha- 
nical stimulation and capsaicin-induced pain appeared normal in 
Clp1‘“* littermates (Supplementary Fig. 10). 

Whereas Clp1‘“* neonates on the CBA/J background had appar- 
ently normal numbers of spinal motor neurons, 4-month-old Clp1‘“* 
mice showed both a marked reduction in ChAT* and, as a second 
marker, SMI-32* spinal motor neurons (Fig. 3a, b and Supplemen- 
tary Fig. 11). The effects of the Clp1 mutation on upper motor neurons 
need to be assessed. Consistent with lower motor neuron loss, we 
observed axonopathy of peripheral nerves such as the sciatic nerve 
(Fig. 3c, d and Supplementary Fig. 12a—c). Immunostaining showed 
amyloid-f precursor protein (APP)-positive structures, a marker for 
axonal injury", in the sciatic nerves of adult Clp1*”* mice (Fig. 3e). In 
line with motor-neuron loss, we observed a marked reduction in the 
number of large diameter (type 1A alpha) fibres, whereas the smaller 
sensory fibres were largely preserved (Supplementary Fig. 12d). 
Dorsal root ganglion (DRG) sensory neurons showed normal mor- 
phology and outgrowth. Whether developmental alterations in mye- 
lination also contribute to the phenotype needs to be further assessed. 
We also observed regional denervation and fragmentation of NMJs 
in the diaphragm, and in various limb and head skeletal muscles, 
resulting in skeletal muscle atrophy; slow-twitch muscles (soleus, 
gluteus) were less affected than the fast-twitch extensor digitorum 
longus or gastrocnemius muscles (Fig. 3f, g and Supplementary Figs 
13-15). Similar results were obtained in SOD1 transgenic mice’. 
Thus, Clp1‘“* mice develop progressive loss of spinal motor neurons 
and exhibit defective NMJs, resulting in impaired motor functions, 
muscular atrophy and limb paralysis. 


CLP1 promotes efficient tRNA exon generation 


To assess potential roles in RNA metabolic processes, first we eval- 
uated the function of CLP1 in mRNA 3’-end cleavage’. The kinetics of 
a pre-mRNA cleavage reaction in nuclear extracts from wild-type and 
Clp1” K mouse embryonic fibroblasts (MEFs) were similar (Sup- 
plementary Fig. 16a), suggesting that ATP binding and/or hydrolysis 
by CLP1 are dispensable for mRNA 3’-end cleavage. CLP1 has been 
implicated in RNA interference (RNAi) by its ability to phosphorylate 
small interfering RNAs in vitro’. We therefore assessed whether 
cellular microRNA (miRNA) processing or stability were affected 
by the CLP1 K127A mutation. However, the levels of various pre- 
and mature miRNAs did not differ between wild-type and Clp1‘“ 
MEFs and tissues, and no effect on miRNA function was detected 
using a luciferase reporter assay (Supplementary Fig. 16b, c). Further- 
more, we found no significant differences in pre-mRNA splicing 
(Supplementary Fig. 17). 
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Figure 3 | Progressive loss of lower motor neurons. a, Immunostaining for 
ChATT (green) motor neurons in the lumbar (L5) spinal cord of 4-month-old 
Clp1*’* and Clp1** littermates on a CBA/J background. DAPI 
counterstaining. b, Mean numbers + s.d. of ChAT* motor neurons in the 
lumbar (L5) spinal cord of 1-day (d) and 4-month-old (mo) Clp1 */* and 
Clp1* littermates. n = 5 mice per group. ***P < 0.001 (t-test). c, Semi-thin 
cross-section of the sciatic nerve of 4-month-old Clp1*’* and Clp1* 
littermates. Toluidin blue staining. d, Mean numbers + s.d. of total nerve fibres 
in the sciatic nerves of 16-day and 4-month-old Clp1*’* and Clp1*’* 
littermates. n = 3 mice per group. *P < 0.05 (t-test). e, Lmmunohistochemical 
detection of APP (arrow) in the sciatic nerve of a 12-month-old Clp 1“ mouse. 
f, g, Whole-mount immunostaining depicting NMJs in the diaphragms of 
5-month-old Clp1*’* and Clp1* littermates. Postsynaptic AChR clusters are 
stained with «-bungarotoxin (red) and Schwann cells are labelled with anti- 
$100 antibodies (green). Arrows in g show NMJ fragmentations. Scale bars: 
a, 50 um; c, 200 pm; e, 100 tum; f, 25 um; g, 100 um. 


It has been reported that CLP1 is part of the human TSEN complex’ 
and is able to phosphorylate tRNA 3’ exons’. We therefore tested 
whether the Clp1‘* mutation affects tRNA splicing. Notably, nuclear 
extracts from Clp1‘““ MEFs generated significantly lower levels of 
tRNA exons (Fig. 4a). In addition, tRNA 3’ exon halves formed by 
incubation with nuclear extracts isolated from Clp1“’“ MEFs lacked a 
5’ phosphate group; 5’ phosphorylation and exon generation were 
rescued by ectopic expression of wild-type CLP1 in Clp1“”“ MEFs 
(Supplementary Fig. 18a—e). Similarly, pre-tRNA cleavage was par- 
tially restored in Clp1‘““ MEEs stably expressing wild-type CLP1, 
whereas overexpression of the CLP1 K127A mutant inhibited 
RNA phosphorylation (Supplementary Fig. 19a-d). Affinity-purified 
CLP1(K127A)-containing TSEN complexes were deficient in the 
generation of tRNA exons, most probably due to reduced levels of 
TSEN2, TSEN34 and TSEN54 subunits (Fig. 4b, c); mRNA levels of 
TSEN2, TSEN34 and TSEN54 were not affected in Cher K MEFs 
(data not shown). Thus, the RNA kinase activity of CLP 1 is important 


©2013 Macmillan Publishers Limited. All rights reserved 


at 
’ Fs + Y 
J C CANIN 
> ote a efe™ 
ST RY EY 
a b 2 See, 
3 Clpt* Clp1** £ 15 30 60 15 30 60 15 30 60 15 30 60 min 
£ 1530 60 153060 min [— Pre-tRNA 
Pre-tRNA 
——— ~~ |— Mature tRNA 
| /~3’ exon 
a ad = By . me ; 
Ls exon 5’ exon 
== a 
5’ exon 
-— Intron 
a 
CaS 
NogN 
NN 
c we co d=simm. Prim. — Spinal cord Spinal cord 
kDa MEFs MEFs (CBA/J) (C57BL/6) 
50 a TAP-myc-CLP 1 $F gggeeceEs 
75 BeaaRaERRER ORES 
so a] HstsEN2 SOOO FCGCOSFC OOO SG 
a. nt 
= Rererces 
HsTSEN34 2 errr as | tRNA (Tyr) 
a 4 i 
59 SE HsTSENS4 i . | [Mature 
tRNA (Tyr) 
60 4 PR i ae ek ee oe 
3 504 
e G 3 
A 7 
sppp UPI Sg le ee ae me Etyrtarnn 
Sou U U fragment 
ws us 
og ioe 
coe un 
U 
we soce [eee cleoeeoonn 
a%e Uage 
A a CG f Clp 1+ Clp1** 
rect A es 
cay acon @esaes 
c G U 6 €G gs € 
UR Uys 26982882 
S nS 53% 5 3% 4% 
y! y! = 
5’ Leader exon 5’ Leader exon ee eae us 


Figure 4 | Identification of a novel tRNA fragment. a, The RNA kinase 
activity of CLP1 is required for efficient tRNA exon generation in vitro. An 
internally labelled intron-containing yeast pre-tRNA’* was incubated with 
nuclear extracts from immortalized Clp1*’* and Clp1‘“ MEFs. Pre-tRNA 
processing was monitored by electrophoresis. b, One-step purified human (Hs) 
TAP-myc-HsCLP1 (wild-type (WT) or K127A) and TAP-myc-HsTSEN2 
complexes were assayed for pre-tRNA cleavage after elution with tobacco etch 
virus (TEV) protease (TEV eluate) by incubation with an internally labelled 
pre-tRNA’, Eluates from HeLa cells expressing the vector without insert were 
used as control. c, The level of TSEN proteins in TAP-myc-HsCLP1 (wild type 
and K127A) TEV eluates was examined by western blot. Similar CLP1 protein 
levels were confirmed using anti-myc antibodies. d, ~41-46-nucleotide (nt) 
Tyr-tRNA fragments accumulate in primary (Prim.) and immortalized (Imm.) 
Clp1* MEFs and spinal cords of 4-month-old CBA/J Clp1** and newborn 
CS57BL/6 Clp1’* mice. Northern blot analyses of total RNA were performed 
with a DNA/LNA probe complementary to the 5’ exon of Tyr-tRNA. U6 RNA 
served as a loading control. e, Secondary structure of the two most abundant 
tRNA fragment species, Tyr-tRNA M. musculus chr14.trnal91-TyrGTA and 
M. musculus chr13.trna958-TyrGTA (mouse July 2007 (mm9) genome 
assembly), as predicted by RNAfold (Vienna RNA Secondary Structure 
Package). f, Detection of Tyr-tRNA fragments from Clp1*’* and Clp1‘“X MEFs 
left untransfected (Untransf.) or transfected with control siRNAs or siRNAs 
against TSEN2. 


for the integrity of the TSEN complex and for efficient generation of 
tRNA exons (Supplementary Fig. 20). 


Accumulation of novel tyrosine tRNA fragments 


Whereas most tRNAs do not contain introns, all mouse tyrosine (Tyr)- 
tRNA genes contain an intronic sequence. Notably, in Clp1‘”“ MEFs 
we detected accumulation of ~41-46-nucleotide Tyr-tRNA fragments 
using a northern probe against the 5’ exon (Fig. 4d). RNA sequencing 
revealed that this fragment comprises a 5’ leader followed by 5’ exon 
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Tyr-tRNA sequences (Fig. 4e and Supplementary Fig. 21a, b). Enzy- 
matic treatments uncovered a 5’ triphosphate modification (Sup- 
plementary Fig. 21c), indicating that this novel tRNA fragment con- 
tains a full-length 5’ leader sequence starting with the transcription 
initiator PPP-nucleotide. RNAi-mediated silencing of TSEN2 in MEFs 
resulted in decreased levels of tRNA fragments (Fig. 4f and Sup- 
plementary Fig. 21d). Bioinformatic and northern blotting analyses 
revealed minor accumulation of such 5’ fragments from other intron- 
containing tRNAs, mostly arginine tRNAs (Supplementary Fig. 22). 
A similar accumulation of tyrosine tRNA fragments was observed 
in the spinal cord of C57BL/6 Clp1‘* neonates and 4-month-old 
Clp1“ mice on a CBA/J background (Fig. 4d). Moreover, we 
observed increased levels of these tRNA fragments in cortex, muscle, 
heart, kidney, muscle and liver (Supplementary Fig. 23a). Steady-state 
levels of mature tRNAs seemed to be normal in MEFs, the spinal cord 
of C57BL/6 Clp1*“* neonates and in postnatal CBA/J Clp1“ mice 
(Supplementary Fig. 23b). Thus, loss of CLP1 kinase activity results in 
the accumulation of novel RNA fragments derived from pre-tRNA. 


Oxidative-stress-induced cell death 


Other tRNA fragments, for example, tRNA-derived, stress-induced 
small RNAs (tiRNAs), are generated in the cytoplasm by the endo- 
nuclease angiogenin acting on mature tRNAs'®"”. Similar to tiRNAs, 
Tyr-tRNA fragments were also induced by H,O;, but were not gen- 
erated by angiogenin. Rather, Tyr-tRNA fragments were present in 
the nucleus and were derived from pre-tRNAs actively transcribed 
by RNA Pol III (Supplementary Fig. 24a-d). Because Tyr-tRNA 
fragments were strongly induced by H20;, we proposed that CLP1 
might have a role in the oxidative stress response. In wild-type MEFs, 
the Tyr-tRNA fragments also accumulated after exposure to the 
reactive oxygen species (ROS) inducers glucose oxidase, menadione 
and paraquat dichloride, but not nefazodone (Supplementary Fig. 
25a-d). Treatment with the mitochondrial ‘poisons’ carbonyl cyanide 
4-(trifluoromethoxy) phenylhydrazone (FCCP) and rotenone, the 
apoptosis inducer staurosporine, the protein-synthesis inhibitor pur- 
omycin, and the DNA-damaging agent camptothecin did not trigger 
the accumulation of Tyr-tRNA fragments (Supplementary Fig. 25e, f). 
Importantly, on H,O, and glucose oxidase challenge, we observed 
increased death of Clp1“’“ MEFs; re-expression of wild-type CLP1 
restored the survival rate to that of control MEFs (Fig. 5a and 
Supplementary Fig. 26a). 

Extending our studies to neurons, we trans-differentiated wild- 
type and Clp1‘’“ MEFs into Hb9-GFP* motor neurons!’, and 
observed typical sodium and potassium currents, as well as normal 
action potentials and responses to excitatory and inhibitory transmit- 
ters (Fig. 5b and Supplementary Fig. 26b-f). However, the resting 
membrane potential of Clp1‘““ motor neurons was depolarized by 
over 10 mV relative to control motor neurons (—57.0 + 2.0 mV for 
n= 11 Clp1*’* and —46.9 + 3.0mV for n = 10 Clp1“* motor neu- 
rons; t-test <0.01), suggesting that Clp1‘’* motor neurons exhibit 
ion-exchange abnormalities. Cell input resistance was similar 
between the two groups of neurons (367+ 94MQ for n=11 
Clp1*’* and 511 + 63 MQ for n = 10 clp 1" motor neurons; t-test 
>0.2). Importantly, similar to MEFs, trans-differentiated Cipr™ af 
motor neurons were more sensitive to H,O,-induced cell death than 
wild-type motor neurons (Fig. 5c). Thus, loss of the catalytic activity 
of CLP1 results in enhanced motor-neuron death in response to oxid- 
ative stress. 


Motor-neuron loss is mediated by p53 

Oxidative stress has been linked to a p53-regulated cell death pathway 
through serine-18 phosphorylation, which has been shown to regulate 
p53-dependent transcription” *". In response to HO», Clp1’“ MEFs 
showed hyperphosphorylation of p53 at serine 18 and increased 
induction of the p53-target gene p21 (also known as CDKN1A); re- 
expression of wild-type CLP1 rescued the H,O-induced serine-18 
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Figure 5 | Neonatal lethality and motor-neuron loss are mediated by p53. 
a, Viability of Clp1*’*, Clp1* and Clp1** 3T3 MEEFs rescued with Flag- 
tagged wild-type CLP1. Cells were exposed in triplicate to the indicated 
concentrations of H,O, for 1h, and mean viability was determined 12h later. 
Data represent the means + s.d. of three independent experiments. **P < 0.01, 
*P < 0,001 (analysis of variance (ANOVA)). b, Representative images of 
Hb9-GFP* trans-differentiated Clp1*’* and Clp1* motor neurons. Scale 
bar: 20 um. c, Wild-type and Clp1** trans-differentiated motor neurons 
(iMNs) were treated in duplicate cultures with 100 uM HO, or water (control) 
for 3h. Mean survival + s.d. was scored after 48 h. *P < 0.05 (paired t-test). 
d, p53 serine-18 phosphorylation (p) in H,O,-treated (100 1M for 1 h) Clp1** 
and Clp1*’* primary MEFs. Numbers indicate hours of recovery after 
stimulation. Time point 0 indicates no stimulation. B-Actin levels served as 
loading control. e, Mean viability + s.d. of p53" Clp1*’* and p53‘ Clp1*’* 
MEFs challenged in triplicate with the indicated concentrations of HzO, for 1h. 
Cell viability was determined 12h later. Data represent the means + s.d. of 
three independent experiments. *P < 0.05 (¢-test). f, g, General appearance 
(f) and lung structures (g; haematoxylin and eosin) of newborn p53 /~ Clp1*/* 
and p53‘ Clp1‘* littermates on a C57BL/6 background. Scale bar: 50 jum. 
h, Whole-mount immunostaining of the diaphragms of 3-day-old 
p53-'~Clp1** and p53‘ Clp1™ littermates showing postsynaptic AChR 
clusters (red, «-bungarotoxin) and axons and branched presynaptic terminals 
of the phrenic nerve (green; neurofilament/synaptophysin immunostaining). 
Scale bar: 250 jim; inset: 25 jim. i, Whole-mount immunostaining of 
diaphragms of E18.5 p53‘ Clp1*’~ and p53 ‘~ Clp1*“* littermates showing 
postsynaptic AChR clusters (red; «-bungarotoxin) and Schwann cells (green; 
S100). Scale bar: 75 um. 


hyperphosphorylation (Fig. 5d and Supplementary Fig. 27a, b). 
Similarly, we found p53 serine-18 hyperphosphorylation in Clp1‘“ 
MEFs in response to the ROS-inducing agent glucose oxidase but not 
the DNA-damaging agent camptothecin (Supplementary Fig. 27c, d). 
Most importantly, genetic inactivation of p53 rescued Clp1“”“ MEFs 
from enhanced death after H,O, treatment (Fig. 5e). Overexpression 
of two different tyrosine tRNA fragments in the mouse motor neuron 
cell line NSC-34 also resulted in enhanced p53 activation in response 
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to HO, (Supplementary Fig. 28a, b). Thus, kinase-dead CLP1 renders 
MEFs more susceptible to oxidative-stress-induced cell death through 
a p53-regulated pathway. 

Notably, we observed a complete rescue of neonatal lethality 
in p53’ Clp1“* mice on the 100% lethal C57BL/6 background 
(Fig. 5f). The viable p53’ Clp1““ pups showed normal extension 
of the lungs (Fig. 5g), normal innervation of the diaphragm, and 
rescue of NMJ formation (Fig. 5h, i). The numbers of ChAT- 
(Supplementary Fig. 29a, b) and SMI-32-labelled (data not shown) 
motor neurons in the spinal cord were comparable to that of newborn 
and 1-month-old control mice. Furthermore, 1-month-old p53’ a 
Clp1‘“ mice did not show muscle weakness (Supplementary Fig. 
29c-e). Older p53’ Clp1** mice could not be assessed because they 
developed tumours due to the loss of p53. To test whether oxidative 
stress is involved in the phenotype, we treated pregnant Clp1‘”* females 
(crossed to Clp1‘’* males on the lethal C57BL/6 background) with the 
ROS scavenger N-acetylcysteine (NAC). NAC treatment resulted 
in viable Clp1‘“* pups and partially restored innervation of the dia- 
phragm (Supplementary Fig. 30). However, pups from NAC-treated 
mothers died within a week after birth, probably because in vivo ROS 
scavenging by NAC is incomplete and/or other pathways contribute to 
the phenotype. Thus, ROS and p53 constitute critical in vivo pathways 
that mediate motor neuronal loss and neonatal death of Clp1‘”*- 
mutant mice. 


Transgenic rescue of motor-neuron defects 


To provide definitive proof that the CLP1 kinase-dead mutation acts 
in motor neurons, we introduced a Flag-tagged wild-type CLP 1-IRES- 
eGFP transgene under the control of the motor-neuron promoter 
Hb9 (ref. 22) into control and Clp1“* mice on a C57BL/6 background 
(Fig. 6a and Supplementary Fig. 31a, b). Transgenic expression of 
wild-type CLP1 completely rescued neonatal lethality of Clp1‘“* mice 
and restored normal numbers of ChAT* motor neurons in the spinal 
cord of newborns (Fig. 6b, c and Supplementary Fig. 31c). Moreover, 
Hb9-CLP1 transgenic Clp1““ mice did not accumulate Tyr-tRNA 
fragments in the spinal cord (Fig. 6d), providing direct evidence that 
accumulation of these tRNA fragments is dependent on the loss of 
CLP1 activity. Phrenic nerve defasciculation and mislocalization, and 
the defects in NMJ formation and innervation of the diaphragm in 
Clp1** mice were also rescued by transgenic expression of wild-type 
CLP1 in Clp1** mice; similar results were obtained for intercostal 
muscles (Fig. 6e and Supplementary Fig. 32a-c). Moreover, $100* 
Schwann cells were restored (Fig. 6f), indicating that the observed 
absence of terminal Schwann cells is probably due to the impaired 
function of CLP1 in motor neurons. 

Specific rescue of CLP1 expression in motor neurons but not other 
cell types led us to test whether kinase-dead CLP1 might affect general 
metabolism, which could then contribute to the observed phenotype. 
Using calorimetric experiments, Clp1‘““ mice did not exhibit any 
apparent alterations in food and water intake, O, consumption, CO, 
production, respiratory exchange rate or heat generation as compared 
to the Clp1*’* transgenic littermates; moreover, heat generation upon 
cold exposure and recovery of body temperature, as a measure for 
sympathetic nerve activity”, were comparable (Supplementary Fig. 
33a-h). These transgenic rescue experiments provide direct evidence 
that CLP1 acts in lower motor neurons and that mutant-CLP1- 
mediated motor defects are responsible for the neonatal lethality 


phenotype. 


Conclusions 


Our results provide the first report on the in vivo function of the RNA 
kinase CLP1. CLP1 kinase-dead mice develop progressive loss of 
spinal motor neurons, leading to muscle denervation and para- 
lysis. Inactivation of CLP1 kinase activity results at the same time in 
poor generation of tRNA exon halves and accumulation of novel, 
hitherto undescribed 5’ leader exon tRNA fragments. This paradoxical 
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Figure 6 | CLP1 acts in motor neurons. a, Cryosections of lumbar (L5) spinal 
cord from E14.5 Clp1*/* and Clp1*’* Hb9-CLP1 transgenic (Clp1*’~ Tg) 
embryos. Transgene expression was localized by eGFP and anti-GFP 
antibodies. DAPI counterstaining is shown. b, Appearance of 1-month-old 
Clp1'’* Clp1* transgenic littermates on a C57BL/6 background. 

c, Immunostaining for ChAT™ (red) motor neurons in the lumbar (L5) spinal 
cord of newborn Clp1‘’*, Clp1*’* transgenic, Clp1* and Clp1* transgenic 
mice on a C57BL/6 background. DAPI staining. Right panel shows mean 
numbers + s.d. of ChAT” motor neurons. n = 5 mice per group. ***P < 0.001 
(ANOVA). d, Levels of Tyr-tRNA fragments in the spinal cord of Clp1 ple 
Clp1‘’* transgenic, Clp1** and Clp1* transgenic mice. Northern blot analyses 
of total RNA using a probe complementary to the 5’ exon of Tyr-tRNA. U6 RNA 
served as a loading control. e, Whole-mount immunostaining depicting NMJs in 
diaphragms of 3-day-old Clp1*’~ transgenic and Clp1*“* transgenic littermates. 
Postsynaptic AChR clusters (red; «-bungarotoxin) and innervating motor axons 
and presynaptic nerve terminals (green; neurofilament/synaptophysin 
immunostaining) are shown. f, Whole-mount immunostaining of the 
diaphragms of 3-day-old Clp1*’* transgenic, Clp1 and Clp1* transgenic 
littermates showing postsynaptic AChR clusters (red; o-bungarotoxin) and 
$100* Schwann cells (green). Scale bars: a, ¢, e, 50 Lum; f, 100 um. 


observation could be explained by a defect in tRNA exon ligation 
in a CLP1 kinase-dead background regulated by oxidative stress. 
Different tRNA fragments have been previously reported in a variety 
of organisms**”*. For example, the cytoplasmatic pre-tRNA-derived 
tRF-1001 fragment, a 3’ trailer molecule, seems to have a role in cell 
viability*®. Stress-induced tiRNAs, derived from the processing of 
mature tRNAs in the cytoplasm", inhibit initiation of protein trans- 
lation by displacing the e[F4F complex from mRNAs". We did not 
observe translation inhibition by our novel Tyr-tRNA fragments in 
metabolic labelling experiments; rather, we found that such fragments 
sensitize cells to oxidative-stress-induced activation of the p53 tumour 
suppressor pathway. The exact molecular mechanism(s) by which 
these tRNA fragments couple to the p53 pathway need to be deter- 
mined. Importantly, motor-neuron loss was rescued in vivo either by 
reducing oxidative stress or by genetically inactivating p53. 


ARTICLE 


Our experiments uncover an unexpected mechanistic link between 
tRNA processing, formation of a new RNA species, and a p53-regu- 
lated progressive loss of lower motor neurons. These results provide a 
conceptual and experimental framework for how alterations in tRNA 
metabolism can affect spinal motor neurons, and might help explain 
fundamental molecular principles in diseases such as amyotrophic 
lateral sclerosis or spinal muscular atrophy. 


METHODS SUMMARY 


Generation of kinase-dead Clp1’““ and Hb9-CLP1 transgenic mice. Complete 
mutant and knock-in Clp1‘”* mice were generated by homologous recombina- 
tion. The Clp1* allele was backcrossed five times to C57BL/6 or CBA/J mice. 
p53-deficient mice and Hb9-GFP transgenic mice were obtained from The 
Jackson Laboratory. Hb9-CLP1 transgenic mice containing the Hb9 promoter 
driving Flag-tagged mouse CLP1 and eGFP were generated in house. All mice 
were maintained according to institutional guidelines. 

Phenotyping. Behavioural phenotyping and assessment of motor functions were 
performed as described in Supplementary Information. Diaphragm and inter- 
costals muscles were stained as described previously’. Anti-neurofilament and 
anti-synaptophysin antibodies were used to label axons and presynaptic nerve 
terminals, respectively. Antibodies to S100 were used to visualize Schwann cells. 
Postsynaptic AChRs were detected with Alexa-594-conjugated «-bungarotoxin. 
Spinal motor neurons were stained with anti-ChAT and anti-SMI-32 antibodies. 
MEFs and trans-differentiated motor neurons were generated as described”. 
Pre-tRNA splicing. Pre-tRNA substrates were generated by in vitro transcrip- 
tion. Labelled yeast pre-tRNA?" or human pre-tRNA?” were incubated with cell 
extracts or TEV eluates, and formation of mature tRNA and/or tRNA exons was 
monitored by phosphorimaging. 

Northern blot analysis. For tRNA detection, total RNA from tissues and cultured 
cells was isolated, subjected to gel electrophoresis and blotted on Hybond-N* 
membranes. Blots were hybridized using [5’ *’P]-labelled DNA/LNA probes to 
detect Tyr-tRNA 5’ and 3’ exons. 

Identification of Tyr-tRNA fragments. Total RNA from primary MEFs was 
separated by gel electrophoresis. The region containing RNA of 37-50 nucleo- 
tides in size was excised and sequenced using an Illumina platform. Reads were 
aligned to the mouse genome (http://gtrnadb.ucsc.edu/Mmusc/Mmusc-by- 
locus-txt.html) using bedtools (v. 2.16.2). 


Received 6 March 2012; accepted 18 January 2013. 
Published online 10 March 2013. 


1. Weitzer, S.& Martinez, J. The human RNA kinase hClp1 is active on 3’ transfer RNA 
exons and short interfering RNAs. Nature 447, 222-226 (2007). 

2. Ramirez, A. Shuman, S. & Schwer, B. Human RNA 5’-kinase (hClp1) can function 
as a tRNA splicing enzyme in vivo. RNA 14, 1737-1745 (2008). 

3. Jain, R.& Shuman, S. Characterization of a thermostable archaeal polynucleotide 
kinase homologous to human Clp1. RNA 15, 923-931 (2009). 

4. deVries, H. eta/. Human pre-mRNA cleavage factor Il, contains homologs of yeast 
proteins and bridges two other cleavage factors. EMBO J. 19, 5895-5904 (2000). 

5. Minvielle-Sebastia, L., Preker, P. J., Wiederkehr, T., Strahm, Y. & Keller, W. The major 
yeast poly(A)-binding protein is associated with cleavage factor IA and functions in 
premessenger RNA 3’-end formation. Proc. Nat! Acad. Sci. USA 94, 7897-7902 
(1997). 

6. Holbein, S. et al. The P-loop domain of yeast Clp1 mediates interactions between 
CF l|Aand CPF factors in pre-mRNA 3’ end formation. PLoS ONE 6, 629139 (2011). 

7. Haddad, R. et al. An essential role for Clp1 in assembly of polyadenylation complex 
CF lAand Pol II transcription termination. Nucleic Acids Res. 40, 1226-1239 (2012). 

8. Ghazy, M.A. et al. The interaction of Pcfl1 and Clp1 is needed for mRNA 3’-end 
formation and is modulated by amino acids in the ATP-binding site. Nucleic Acids 
Res. 40, 1214-1225 (2012). 

9. Paushkin, S. V., Patel, M., Furia, B. S., Peltz, S. W. & Trotta, C. R. Identification of a 
human endonuclease complex reveals a link between tRNA splicing and pre- 
mRNA 3’ end formation. Ce// 117, 311-321 (2004). 

10. Trotta, C.R. etal. The yeast tRNA splicing endonuclease: a tetrameric enzyme with 
two active site subunits homologous to the archaeal tRNA endonucleases. Cel! 89, 
849-858 (1997). 


. Zillmann, M., Gorovsky, M. A. & Phizicky, E. M. Cons 


splicing in eukaryotes. Mol. Cell. Biol. 11, 5410-54 


erved mechanism of tRNA 
6 (1991). 


. Zhao, C. et al. Charcot-Marie- 


ooth disease type 2A caused by mutation ina 


microtubule motor KIF1Bf. Cell 105, 587-597 (2001). 

13. Sanchez-Carbente, M. R., Castro-Obregon, S., Covarrubias, L. & Narvaez, V. 
Motoneuronal death during spinal cord development is mediated by oxidative 
stress. Cell Death Differ. 12, 279-291 (2005). 

14. Medana, I. M. & Esiri, M. M. Axonal damage: a key predictor of outcome in human 
CNS diseases. Brain 126, 515-530 (2003). 

15. Atkin, J.D. etal. Properties of slow- and fast-twitch muscle fibres in a mouse model 
of amyotrophic lateral sclerosis. Neuromuscul. Disord. 15, 377-388 (2005). 


28 MARCH 2013 | VOL 495 | NATURE | 479 


©2013 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


16. Yamasaki, S., lvanov, P., Hu, G. F. & Anderson, P. Angiogenin cleaves tRNA 
and promotes stress-induced translational repression. J. Cell Biol. 185, 35-42 
(2009). 

17. lvanov, P., Emara, M. M., Villen, J., Gygi, S. P. & Anderson, P. Angiogenin-induced 
tRNA fragments inhibit translation initiation. Mol. Cell 43, 613-623 (2011). 

18. Son, E. Y. et al. Conversion of mouse and human fibroblasts into functional spinal 
motor neurons. Cell Stem Cell 9, 205-218 (2011). 

19. Lambert, P. F., Kashanchi, F., Radonovich, M. F., Shiekhattar, R. & Brady, J. N. 
Phosphorylation of p53 serine 15 increases interaction with CBP. J. Biol. Chem. 
273, 33048-33053 (1998). 

20. Dumaz, N. & Meek, D. W. Serine 15 phosphorylation stimulates p53 
transactivation but does not directly influence interaction with HDM2. EMBOJ. 18, 
7002-7010 (1999). 

21. Chao, C. et al. Cell type- and promoter-specific roles of Ser18 phosphorylation in 
regulating p53 responses. J. Biol. Chem. 278, 41028-41033 (2003). 

22. Arber, S. et al. Requirement for the homeobox gene Hb9 in the consolidation of 
motor neuron identity. Neuron 23, 659-674 (1999). 

23. Hanada, R. eta/. Neuromedin U has a novel anorexigenic effect independent of the 
leptin signaling pathway. Nature Med. 10, 1067-1073 (2004). 

24. Tuck, A.C. & Tollervey, D. RNA in pieces. Trends Genet. 27, 422-432 (2011). 

25. Hurto, R.L. Unexpected functions of tRNA and tRNA processing enzymes. Adv. Exp. 
Med. Biol. 722, 137-155 (2011). 

26. Lee, Y. S., Shibata, Y., Malhotra, A. & Dutta, A. A novel class of small RNAs: tRNA- 
derived RNA fragments (tRFs). Genes Dev. 23, 2639-2649 (2009). 

27. Herbst, R,, Iskratsch, T., Unger, E. & Bittner, R. E. Aberrant development of 
neuromuscular junctions in glycosylation-defective Large”? mice. Neuromuscul. 
Disord. 19, 366-378 (2009). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We thank A. Meixner, M. Foong, T. Nakashima, H. C. Theussl, 
J. R. Wojciechowski, A. Bichl, the mouse pathology unit of the Universitatsklinikum 


480 | NATURE | VOL 495 | 28 MARCH 2013 


Hamburg-Eppendorf, and G. P. Resch for discussions and technical support. We also 
thank T. Buerckstuemmer for providing the pRV-NTAP vector. J.M.P. is supported by 
grants from the Institute of Molecular Biotechnology of the Austrian Academy of 
Sciences (IMBA), the Austrian Ministry of Sciences, the Austrian Academy of Sciences, 
AustroMouse network of Genome Research in Austria (GEN-AU), Apoptosis systems 
biology applied to cancer and AIDS (ApoSys) and a European Research Council 
Advanced Grant from the European Union. J.M., S.W. and B.M. are supported by IMBA 
and GEN-AU (AustroMouse). T.H. is supported by the Japan Society for the Promotion 
of Science and the Astellas Foundation. J.K.|. was supported by National Institutes of 
Health (NIH) grant K99NSO077435-01A1.M.G. is supported by grants from the German 
Research Foundation (DFG) (FG885 and GRK 1459), the Landesexzellenzinitiative 
Hamburg (Neurodapt). R.H. was supported by the Austrian Science Fund (P19223, 
P21667). CJ.W. is supported by the NIH (NSO38253). 


Author Contributions T.H. generated mutant mice, performed mouse phenotyping 
and developed cell lines with help from R.H. S.W. and B.M. performed all biochemical 
assays and northern blots. V.K. performed histological analysis. F.S.,H.M.,S.J.C.andAY. 
provided key reagents and technical help for generation of mutant mice. |.T. analysed 
tRNA fragment distributions. M.O. performed calorimetric experiments. B.J.W., JI, 
K.C.E. and C.J.W. performed and helped with transdifferentiation experiments and 
patch clamping. R.H. performed immunostainings and assessment of NMJs and 
motor-neuron pathfinding. A.M. and A-Y. performed DRG explant cultures, and J.R. and 
R.K. carried out exon arrays. C.B. and M.G. performed histological and 
immunohistochemical staining of, and assessed, peripheral nerve and brain 
structures. J.M. and J.M.P. coordinated the project and wrote the manuscript. 


Author Information Data have been deposited in the Gene Expression Omnibus under 
accession numbers GSE35924 and GSE39275. Reprints and permissions information 
is available at www.nature.com/reprints. The authors declare no competing financial 
interests. Readers are welcome to comment on the online version of the paper. 
Correspondence and requests for materials should be addressed to J.M. 
(javier.martinez@imba.oeaw.ac.at) or J.M.P. (josef.penninger@imba.oeaw.ac.at). 


©2013 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


doi:10.1038/nature11991 


Structural visualization of key steps in 
human transcription initiation 


Yuan Hel, Jie Fang’, Dylan J. Taatjes* & Eva Nogales*"* 


Eukaryotic transcription initiation requires the assembly of general transcription factors into a pre-initiation complex that 
ensures the accurate loading of RNA polymerase II (Pol II) at the transcription start site. The molecular mechanism and 
function of this assembly have remained elusive due to lack of structural information. Here we have used an in vitro 
reconstituted system to study the stepwise assembly of human TBP, TFIIA, TFIIB, Pol II, TFIIF, TFITE and TFIIH onto 
promoter DNA using cryo-electron microscopy. Our structural analyses provide pseudo-atomic models at various 
stages of transcription initiation that illuminate critical molecular interactions, including how TFIIF engages Pol II and 
promoter DNA to stabilize both the closed pre-initiation complex and the open-promoter complex, and to regulate 
start-site selection. Comparison of open versus closed pre-initiation complexes, combined with the localization of the 
TFIIH helicases XPD and XPB, support a DNA translocation model of XPB and explain its essential role in promoter opening. 


Accurate and regulated initiation of eukaryotic gene transcription 
represents a major step in gene regulation, requiring the coordinated 
activity of a large number of proteins and protein complexes. The basal 
transcriptional machinery includes RNA polymerase II (Pol II) along 
with a series of general transcription factors (GIFs) (TFIIA, TFIIB, 
TFIID, TFIE, TFIIF and TFIIH) that assemble into a ~2-megadalton 
(MDa) complex on core promoter DNA. This pre-initiation complex 
(PIC) is essential to direct accurate transcription start site (TSS) selec- 
tion, promoter melting and Pol II promoter escape’. Despite recent 
structural advances on Pol II** and subcomplexes of the PIC®, the 
molecular assembly details of this essential complex remain elusive. 

In vitro reconstitution of this process has provided a model for the 
sequential assembly pathway of transcription initiation. TFIID is the 
first factor specifically recruited to the promoter. This megadalton 
complex includes the TATA-binding protein (TBP), which is suffi- 
cient for basal transcription on TATA-box-containing promoters”*”. 
TFIA and TFIIB are then recruited, further stabilizing the interaction 
between TBP and promoter DNA. Next, Pol II, probably in asso- 
ciation with TFIIF, adds to the growing PIC. Finally, TFIIE and 
TFHH, which is required for DNA melting, are recruited to form 
the transcriptionally competent PIC”. 

Structural characterization of PIC assemblies is challenging and 
has been limited to a small number of electron microscopy (EM) 
studies*"'°. Crystallographic structures of individual components, 
combined with biochemical data, have led to a number of structural 
models for PIC subcomplexes, in either a closed or open-promoter 
conformation®’""'*, In spite of this progress, important questions 
remain unanswered, such as how TFIIB and TFIIF serve complemen- 
tary roles during the promoter opening process or how TFIIE posi- 
tions TFIH in a configuration capable of melting the DNA. 

Here we present cryo-EM snapshots of PIC intermediates during 
sequential assembly (see Supplementary Video 1). A reconstitution 
system allowed us to localize each GTF within the cryo-EM structures, 
track the effect of each additional factor on the PIC, and ultimately 
reveal the network of protein-protein and protein-DNA interactions 
governing PIC assembly. Furthermore, by visualization of an open- 
promoter complex (OC) mimic, we have obtained new mechanistic 


details concerning promoter melting. Together, our structures pro- 
vide unprecedented insights into the molecular assembly, organiza- 
tion and functional roles of different GIFs during transcription 
initiation. 


Stepwise assembly and visualization of human PIC 

To structurally characterize the sequential assembly of GTFs neces- 
sary for human transcription initiation, we developed an in vitro 
system for reconstitution and purification of a simplified PIC, in 
which TBP substituted for TFIID, and that ultimately contained 31 
polypeptides. Our promoter DNA contained TATA, BREu and BREd 
(upstream and downstream TFIIB recognition elements, respectively), 
and INR (initiator element) core promoter elements and was immo- 
bilized on streptavidin beads (Fig. 1a). After stepwise assembly of PIC 
intermediates by sequential incubation with the desired GTFs, stable 
complexes were released by restriction enzyme digestion. The effec- 
tiveness of this approach for structural characterization of the PIC 
intermediates was initially tested by single-particle EM of negatively 
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Figure 1 | Stepwise assembly of the human PIC. a, Reconstitution strategy 
for human PIC by sequential assembly. Schematic of the DNA highlighting the 
relative positions of the core promoter elements used and Sall restriction site 
(top). Colour scheme for the components of the PIC is shown at the bottom. 
b-e, Negative stain reconstructions of PIC assembly intermediates for TBP- 
TFIIA-TFIB-DNA-Pol II (b), plus TFIIF (c), plus TFIIE (d), and plus TFITH (e). 
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stained samples (Supplementary Fig. 1). This initial analysis allowed 
us to localize each GTF within the context of the full assembly (Fig. 1b-e), 
although it precluded the visualization of DNA. The stepwise purification 
approach enabled us to describe the effect of factor addition on the rest 
of the PIC, which cannot be achieved by studying individual factors or 
the complete PIC. The negative stain structures were then used as 
starting references to generate cryo-EM reconstructions of the PIC 
subcomplexes with improved resolution that allowed visualization 
of the DNA and accurate docking of existing crystal structures (Sup- 
plementary Figs 2-5). 


Recruitment of Pol II onto promoter DNA 


To start, we obtained the cryo-EM structure of a PIC subcomplex 
containing TBP, TFIIA, TFIIB, Pol II and core promoter DNA 
(Fig. 2a). Crystal structures of TBP-TFIIA-DNA”, TBP-TFIIB- 
DNA“ and yeast Pol II-TFIIB''’*'> could be unambiguously docked 
into our density map as rigid bodies (Supplementary Fig. 2e). This 
procedure validated our cryo-EM structure while also allowing the 
localization of each factor to generate a pseudo-atomic model of the 
assembly. The visible portion of DNA accounts only for the upstream 
core promoter elements, which are stabilized by protein-DNA inter- 
actions, whereas the DNA downstream of the BREd lacks contact with 
the PIC and was not visualized due to its flexibility (Fig. 2a). 

A yeast PIC model has been proposed previously, based on super- 
imposing crystal structures using the common protein as an anchor 
point’*. Our data show that a simple pivoting of the carboxy-terminal 
cyclin fold domain of TFIIB around the amino-terminal one can 
explain the position of the TBP/TFIIA module in our map using the 
available crystal structures, without disrupting the interaction between 
the N-terminal cyclin fold of TFIIB and Pol II (Supplementary Fig. 7). 
This small discrepancy with the previous piece-wise model is unlikely 
to be due to differences between the human and yeast systems, but 
rather a re-organization with respect to individual crystal structures 
upon interaction of GTFs on the core promoter. 


Effect of TFIIF on PIC assembly 


According to the sequential assembly pathway, TFIIF is recruited to 
the promoter in association with Pol II’. To understand its structural 
role during PIC assembly, we added TFIIF separately to our recon- 
stituted system. By comparing the cryo-EM structures of PIC sub- 
complexes in the absence and presence of TFIIF, we identified 
additional protein densities appearing at two nearby locations, by 
the lobe and protrusion domains of Pol II (Fig. 2b). Importantly, 
the addition of TFIIF also resulted in the stabilization of the down- 
stream DNA along the cleft of Pol II, in a position that is distinct from 
a previously proposed model’? (Supplementary Fig. 8). Thus, TFIIF is 
required for the engagement of DNA by Pol II within the context of a 
closed PIC. 

The crystal structure of the human TFIIF dimerization domain"® 
could be unambiguously fitted into the new density ascribed to TFIIF 
by the lobe domain of Pol II using rigid-body docking, only slightly 
shifted from previous models that were based on crosslinking data’”"* 
(Fig. 2c and Supplementary Fig. 3e). No obvious density was observed 
for the arm domain of RAP74, which extends about 45 A from the end 
of the RAP74 barrel’’, indicating that it is mobile at this stage of PIC 
assembly. A small clash between the RPB2 lobe and the RAP74 «1 
helix can be explained by a reorganization of this element in the 
context of the PIC (see Supplementary Fig. 9 for details). 

In addition to interacting with Pol II, our cryo-EM structure indi- 
cates that a region of TFIIF contacts the BREd directly (Fig. 2c, d). We 
propose that this region corresponds to the C-terminal winged-helix 
(WH) domain of RAP30, based upon the following: (1) the size of this 
additional TFIIF density is consistent with the RAP30 WH domain; 
(2) RAP30 has been shown to crosslink to BREd’’ and its C-terminal 
WH domain has been identified to be in direct contact with the 
protrusion domain of Pol II’’; (3) RAP30, rather than RAP74, is 
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Figure 2 | TFIIF engagement triggers a concerted conformational change in 
the PIC. a,b, The positions of TBP, TFIIA, TFIIB and DNA promoter elements 
relative to Pol II are indicated in the cryo-EM reconstruction in the absence 
(a) or presence of TFIIF (b). TFIIF stabilizes promoter DNA (cyan). ¢, Crystal 
structures for TBP-TFIIA-DNA (Protein Data Bank (PDB) accession INVP), 
TBP-TFIIB-DNA (PDB 1C9B), Pol II-TFIIB (PDB 4BBR), RAP30/74 
dimerization domain (PDB 1F3U'*), RAP30 WH domain (PDB 1BBY“’), and 
modelled B-form DNA (globally bent between —23 and +7 by 18°) are shown 
docked into transparent EM densities. The mobile clamp of Pol II is docked as a 
separate domain. d, Bottom view showing the presence of a nucleoprotein 
complex by the upstream core promoter elements. The DNA densities have 
been segmented out for clarity. Perturbed residues within the RAP30 WH 
domain during DNA titrations are coloured in gold’. The possible path for the 
RAP30 N terminus is highlighted with dotted purple lines. DNA is shown in 
ribbon representation. e, Position of moving structural elements before (grey) 
and after (coloured) TFIIF binding. The rest of the PIC components are shown 
in transparency and major structural rearrangements are depicted by arrows. 


required for accurate transcription initiation” and deletion mutants 
of RAP30’s WH domain are lethal in yeast'*’’. This WH domain 
therefore contributes to a unique nucleoprotein complex formed by 
TBP, TFIIA and the TFIIB cyclin fold, as they contact the core pro- 
moter elements upstream of the INR, which is further stabilized by 
the protrusion domain of Pol II (Fig. 2d and Supplementary Fig. 10). 

Comparison of structures shown in Fig. 2a and b strongly suggests 
that the overall effect of TFIIF on the assembling PIC is a clear stabi- 
lization of the DNA along the Pol II cleft. Given its location, we 
propose that the RAP30 WH domain has an essential role in position- 
ing the flexible promoter DNA downstream of BREd along the Pol II 
cleft, thus facilitating subsequent steps in the promoter melting 
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process. However, a contribution to this DNA stabilization by the 
dimerization domain is also possible. Correct positioning of the 
DNA by TFIIF is consistent with its role in promoter opening and 
TSS selection***. Our structures also revealed the opening of the Pol 
II clamp domain as it accommodates the downstream DNA (Fig. 2e 
and Supplementary Fig. 11). In addition, we observe further rotation 
of the TBP-TFIIA-TFIIB subcomplex with respect to previously 
proposed models, positioning it even closer to Pol II (Fig. 2e and 
Supplementary Fig. 7). 


TFIIE-containing PIC and DNA contacts 

The addition of TFIIE to the growing PIC resulted in new protein 
density that connects TFIIF with the stalk domain of Pol II (Fig. 3a). 
The resolution of this reconstruction (11 A) was the highest obtained 
for any of the complexes studied, indicating that TFIIE stabilizes 
the PIC. The density corresponding to TFIIE, however, was the least 
well-defined element according to local resolution calculations (Sup- 
plementary Fig. 4f), which may be due to flexible connections between 
the WH domains predicted within the TFIIE structure® (Fig. 3a, b). 
One end of TFIIE associates with the stalk of Pol II by interacting with 
the RPB7 L45 loop (Fig. 3c), which has been predicted to stabilize the 
OC and the deletion of which completely abolished transcription”’. 
Also consistent with the positioning of TFIIE in our PIC structure, 
a zinc ribbon domain within the archaeal homologue of TFIIE 
was found to be located near the base of the stalk domain of the 
polymerase*®. Away from the stalk, the TFIIE density contacts the 
Pol II clamp domain to interact ultimately with the WH domain of 
TFHF. A model of the three WH domains within TFIE interact- 
ing with elements of the clamp head has been proposed based on 
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Figure 3 | Stabilization of the PIC in the closed conformation by TFIIE. 

a, Segmentation of the cryo-EM reconstruction of human PIC containing 
TFIE (left), and docking of existing crystal structures (right). b, Same view as 
the right panel in a depicting the cryo-EM density corresponding to TFIIE and 
the RAP30 WH domain. A chain of four WH domains formed by the C 
terminus of RAP30 and both subunits of TFIIE (modelled based on 
crosslinking data®) can be roughly fitted into the cryo-EM density. c, Regions of 
contact between TFIIE and Pol II. The clamp and stalk domain of Pol II 

are shown in grey, TFIE in maroon. The rest of the PIC components are shown 
in transparency. The clamp coiled-coil domain is shown in gold and the 
residues that crosslink to TFIIE” are coloured in navy blue. d, EM density 
corresponding to promoter DNA is shown together with the PIC 

ribbon model. Structural elements making direct contacts with the 

promoter DNA are depicted. 
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crosslinking studies® (Fig. 3c). Although the model cannot fit the 
EM density perfectly, the overall path of the three tandem WH 
domains in the model follows the elongated TFIIE cryo-EM density 
and ends by directly contacting the RAP30 WH domain (Fig. 3b). 
Therefore, a continuous chain of four WH domains appears to link 
the Pol II clamp region with the TBP-TFIIA-TFIIB-DNA subcom- 
plex, preventing DNA from leaving the cleft. 

Our 11A resolution reconstruction of the PIC containing TFIIE 
starts to reveal the major and minor grooves of the promoter DNA 
(Fig. 3d), allowing us to model its path. We found that linear B-form 
DNA could not be accommodated into the DNA density (Supplemen- 
tary Fig. 12), requiring instead a smooth bend of 18° between posi- 
tions —23 and +7 that fitted both the path and groove features 
of the EM density. Interestingly, a hypersensitivity region around 
—6 position” locates at one of the downstream DNA-Pol II interfaces 
as discussed below. 

We observed two protein contacts with the downstream DNA. One 
connection involves the 3-strand B-sheet below the clamp head 
whereas the other is mediated by a 2-helix bundle at the tip of the 
RPBS5 jaw (Fig. 3d). Interestingly, these are the only two positively 
charged protein surfaces on Pol II along the path of the downstream 
DNA (Supplementary Fig. 12). The INR element is sandwiched 
precisely between these two protein-DNA contacts, an arrangement 
that may be relevant in promoter melting at the correct position in 
the DNA. The slightly open clamp conformation seen upon DNA 
placement onto the cleft after TFIIF addition is probably due to the 
interaction of the DNA with the clamp head f-sheet (Fig. 3d and 
Supplementary Fig. 11b, c). 

The spacing between the TATA box and the TSS can vary between 
species, whereas the region within promoter DNA that is melted 
during transcription initiation is ~20 base pairs (bp) downstream 
of TATA”. We inferred the approximate position of flexible ele- 
ments within TFIIB and TFIIF by docking their crystal structures as 
rigid bodies within our cryo-EM density. Importantly, we find that 
both the TFIIB linker helix and the TFIIF arm domain align with the 
promoter melting start site (Supplementary Fig. 13). This arrange- 
ment is consistent both with the proposed role of the linker helix of 
TFIIB in promoter opening’ and with the crosslinking of the arm 
domain of RAP74 to the TFIIB linker near the active site’”*°, as well 
as with the suppression of the TSS defect of TFIIB mutations by a 
mutant within the arm domain of TFIIF*’. In our rigid-body fitting, 
the linker helix of TFIIB overlaps with the DNA in our model, sug- 
gesting a rearrangement of the helix relative to the clamp domain 
at this stage in the PIC assembly. Finally, the tip of the TFIIF arm 
domain contains seven positively charged residues, whereas four 
positively charged residues are present on the side of the TFIIB linker 
helix that faces the DNA (Supplementary Fig. 13). The juxtaposition 
of these domains within the melting start site is consistent with their 
direct role in DNA interactions. 

The structural features of our Pol II-based PIC model are probably 
conserved with Pol I and Pol III, the two other RNA polymerases in 
eukaryotes. A side-by-side comparison of our Pol II-based PIC model 
with a cryo-EM structure of native Pol III agrees with this hypothesis” 
(Supplementary Fig. 14). 


Transition to the open-promoter complex 

It is well established that the PIC remains stably associated during 
transcription initiation until Pol II undergoes promoter escape’. 
Preceding this step, however, Pol II needs to transition into an OC 
in which the melted single-stranded DNA is inserted into the active 
site. To gain structural insight into the transition from a closed to an 
open promoter complex, we generated a ‘functional mimic’ of the PIC 
in its open conformation by modifying the promoter substrate used 
to form the closed PIC (Fig. 4a). We replaced the segment of DNA 
containing the INR element with a 3’-tailed sequence previously used 
to create an arrested transcription state in yeast Pol II’. We matched 
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the arrested position of Pol II on the template exactly to the TSS used 
in our studies, thereby creating a Pol II-nucleic acid complex con- 
taining only ~5 nucleotides at the active site, while still containing 
upstream core promoter elements available for assembling the rest of 
the PIC. We found that TFIIE had a higher affinity for the OC, as 
excess TFIIE had to be used to saturate the closed PIC, but not the 
open state mimic. Interestingly, excess TFIIE was no longer required 
in the context of the closed PIC when TFIIH was also included (see 
below), in agreement with previous studies suggesting cooperative 
binding of TFIIE and TFIIH within the PIC****. 

The reconstruction of the OC mimic resembled that of the PIC in 
the closed conformation, with all the GIFs remaining at identical 
positions (Fig. 4b and Supplementary Figs 4e, 5e and 10b-d). This 
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Figure 4 | Conformational rearrangements of the PIC upon promoter 
opening. a, Nucleic acid scaffold used to generate a mimic of the OC. Filled and 
open circles correspond, respectively, to the core promoter used in Fig. la, and 
to a replacement sequence containing a 3’-tailed sequence previously designed 
to generate an arrested Pol II’*. The schematic indicates, in pink, the position 
that would correspond to the INR. b, Segmentation of the cryo-EM 
reconstruction of the human PIC in the open state (left), and docking of existing 
crystal structures, together with a modelled DNA bubble’? (right). GIFs adopt 
the same architecture as in the closed PIC. c, EM densities of promoter DNA in 
the closed PIC and OC were segmented and superimposed with respect to Pol 
II. Movements of the DNA between the two states are shown by arrows. The 
rotation accompanying translocation of the downstream DNA occurs within 
the plane of the view shown on the left panel (and thus perpendicular to view 
shown on the left). d, Structures before (grey) and after (coloured) promoter 
opening are shown using fitted crystal structures. The Pol II clamp comes down 
over the open bubble, in a conformation similar to that seen for the elongation 
state*’. e, Segmented cryo-EM density showing the now visible arm domain of 
TFIIF connecting the rest of TFIIF with the rudder of RPB1. Crystal structures 
of TFIIB and TFIIF were docked as rigid bodies into the cryo-EM density. 
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finding is consistent with the prevalent hypothesis that the PIC 
assembled at the promoter remains intact until promoter escape’. 
In contrast, the downstream DNA adopted a conformation previously 
observed for elongating Pol II, indicating that the template strand was 
inserted through the positively charged cleft into the active site* 
(Fig. 4b, c). The single-stranded segments are invisible at our resolu- 
tion or not present (non-template strand). As a reasonable model, 
the bubble depicted in Fig. 4 has been derived from a previous model 
based on FRET studies on the yeast system”’. 

When the position and orientation of the downstream DNA is 
compared between the closed PIC and the OC mimic, it is clear that 
there is a change in orientation concomitant with the insertion of the 
downstream DNA into the active site (Fig. 4c), indicating that the DNA 
rotates on a plane as it translates, while maintaining a point of contact 
between the DNA and RPB5 that corresponds to one of the two con- 
tacts present in the closed state (the one downstream of the INR). 

Other than the repositioning of the DNA within the active site, two 
main differences were observed upon comparison of the OC mimic 
and the closed PIC structures. First, the clamp domain in the open 
state moves down to engage the open DNA bubble, adopting the con- 
formation observed in the elongation state” (Fig. 4d and Supplemen- 
tary Fig. 11d). Thus, the clamp domain completes an open to closed 
transition throughout the process of PIC assembly and promoter 
opening (Supplementary Fig. 11), a cycle also reported for the bac- 
terial system**. Second, an additional protein density now extends 
from the bottom of the clamp and connects to the dimerization 
domain of TFIIF (Fig. 4e). Rigid-body fitting of crystal structures 
suggests that this density corresponds to the stabilized rudder of Pol 
II and the arm domain of TFIIF. We propose that these elements 
interact with each other as the clamp closes down over the melted 
DNA. Interestingly, this proposed interaction would prevent re- 
annealing of the melted DNA. The TFIIB linker helix is near this 
position and probably participates in the promoter melting process 
as well. This proposal is consistent with our hypothesis that the flex- 
ible TFIIB linker helix and the TFIIF arm domain act together in 
promoter opening (Supplementary Fig. 13). Thus, our structure and 
pseudo-atomic model provide a possible explanation for the enig- 
matic role of TFIIF in promoter opening and TSS selection”. 


Positioning of XPB for DNA translocation 


To gain insight into the natural promoter opening process carried out 
by TFIIH, we used the same purification strategy used for the previous 
closed PIC subcomplexes but included the purified, endogenous 10- 
subunit human TFIIH complex as a last step before elution. Given the 
scarcity of purified human TFIIH, this study was limited to negative 
stained samples, which require less material. The three-dimensional 
reconstruction of the TFI[H-containing PIC showed a substantial 
additional density extending away from Pol II, consistent with the 
large molecular mass of TFIIH (0.45 MDa) (Figs le and 5a and Sup- 
plementary Fig. 6e). Surprisingly, only two contacts are observed 
between TFIIH and the rest of the PIC. One is with the Pol II’s stalk 
domain, at the site of interaction with TFIIE. The other contact pro- 
bably involves the interaction of TFIIH directly with the downstream 
DNA. Although the DNA is not visible in this negative stain recon- 
struction, its position can be extrapolated from the cryo-EM structure 
of the PIC containing TFIIE (Fig. 5a). 

The CAK subcomplex (CDK7-cyclin H-MAT1) of TFIIH, which 
phosphorylates the C-terminal domain (CTD) of RPB1, is missing 
from our PIC reconstruction based on comparison with a recent EM 
study of yeast TFIIH*’. When we analysed images of free human 
TFIH, an additional density that could accommodate the mass of 
the CAK subcomplex appeared highly mobile, in agreement with the 
yeast TFIIH data®’ (Supplementary Figs 15 and 16). Interestingly, 
when this new density, which fits the crystal structures of CDK7 
and cyclin H, is placed in the context of the full PIC, it faces towards 
the CTD of RPB1 (Supplementary Fig. 16). 
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Figure 5 | Positioning of TFITH helicases and model of PIC assembly and 
promoter opening. a, Negative-stain structure of the full human PIC in the 
closed conformation (as in Fig. le). EM density corresponding to the TFI[H 
core complex, lacking any visible CAK subcomplex, is coloured in pink. A 
homology model for XPB° (navy blue) and the crystal structure of XPD (PDB 
3CRV, dark green) are shown docked into the core TFIIH density. The docking 
suggests different roles for the XPB and XPD helicases in promoter opening. 
DNA phosphates crosslinked to XPB are indicated by pink or cyan spheres*’. 
b, Schematic of PIC assembly and promoter melting. Pol II is recruited through 
interaction with TFIIB to the promoter, which is engaged by TBP-TFIIA- 
TFIIB (1). TFUF stabilizes the TBP-TFIA-IIB-Pol II protrusion interaction 
hub and also positions the downstream DNA onto the cleft, forcing the clamp 
to swing into a slightly open state (2). TFIIE binding further stabilizes the PIC 
by interacting with the Pol II stalk, the clamp, and with TFIIF on the other side 
of the Pol II cleft (3). The TFIIE-containing PIC serves as the platform for 
TFIH binding and correctly positions XPB downstream of the INR element 
(4). During strand separation, the clamp domain starts to swing down. The arm 
domain of TFIIF comes close to the Pol II rudder and the TFIIB B-linker. 
Stabilization of these interactions forms a physical barrier for DNA re- 
annealing (5). During promoter opening, the translocase activity of XPB would 
“screw in’ the DNA towards the Pol II active site, leading to a Pol II open- 
promoter state ready for RNA synthesis (6). 


The reconstruction of the TFII[H-containing PIC allowed us to 
dock the crystal structure of XPD* and a homology model of XPB° 
(Fig. 5a and Supplementary Fig. 6e). XPD is positioned in close pro- 
ximity to TFHE and the Pol II stalk, but away from DNA, consistent 
with a scaffolding role in transcription initiation’. On the other hand, 
XPB docked directly on the downstream DNA path, between the +10 
and +20-bp position relative to the TSS (Fig. 5a). This position is 
consistent with previous crosslinking data using purified TFIIH”, but 
inconsistent with a recent crosslinking study using overexpressed 
XPB in extracts, in which XPB was proposed to be positioned closer 
to the TFIIE WH domains and the INR element®. This result might 
reflect an alternative position of this protein during the assembly of 
the PIC, a distinct position of XPB on the DNA when out of the TFHH 
complex, or the effect of other factors like TFIID and mediator on PIC 
organization. The position of XPB that we observed within the TFIIH 
density, together with the movement of the downstream DNA inferred 
from comparison of our reconstructions of the closed and open states 
of Pol II, suggests how XPB could act as a DNA translocase. A translocase 
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model for XPB has previously been proposed’®, but our structure now 
shows XPB positioned further downstream, leaving enough space 
around the INR element for it to be melted during this process. 

We believe that the position of XPB suggests a DNA insertion 
process in which, as XPB walks on the DNA away from the rest of 
PIC, the DNA would be translocated in the opposite direction and 
pushed into the Pol II cleft while maintaining a point of contact with 
RPBS5 (which starts involving the DNA just downstream of the INR in 
the closed complex). This happens concomitantly with a rotation of 
the DNA, with the RPB5 contact probably serving as a pivot point. As 
XPB walks on the DNA helix, it would generate supertwist that would 
be relaxed by unwinding. Although this unwinding cannot happen in 
the DNA that is tightly wrapped and stabilized by the TBP-TFIIA- 
TFIIB-RAP30-protrusion module, it would be facilitated and/or 
stabilized in the DNA region between the BREd and the INR, where 
the arm domain of RAP74 makes contact with the Pol II rudder and 
regions of TFIIB. 


Concluding remarks 


The combination of structures described here provides unprece- 
dented mechanistic insight into the stepwise assembly of the human 
PIC, defining key protein-protein and protein-DNA interactions 
important for PIC function (Fig. 5b and Supplementary Video 1). 
Our structures reveal the location and role of RAP30 WH domain 
within an essential upstream nucleoprotein subcomplex. Its critical 
function in structurally stabilizing the whole PIC is highlighted by our 
direct visualization of the DNA as it is positioned along the Pol II cleft 
upon TFIIF binding. We also show a direct interaction between the 
arm module of the TFIIF dimerization domain and the rudder domain 
of Pol II upon formation of the OC, leading to a direct mechanistic 
model of how this TFIIF element facilitates and/or maintains strand 
separation concomitant with the closing down of the clamp domain of 
Pol II. Our structures show how two essential factors, TFIIB and TFIIF, 
come together at critical locations for their activity in the context of a 
full PIC. Our studies also reveal how TFIIH, because of its large size, 
can simultaneously interact with TFIIE at the base of the Pol II stalk 
and position XPB on downstream DNA. 

Our studies of the closed PIC and an OC mimic illuminate the 
structural transitions necessary during the process of promoter melt- 
ing. The apparent movement of downstream DNA observed when 
comparing the closed PIC and OC structures, together with the posi- 
tioning of XPB on the downstream DNA, suggests how XPB could act 
as a DNA translocase to thread approximately 10 bp of downstream 
double-stranded DNA into the cleft. This translocating activity would 
push against the stably bound upstream DNA around the TATA box 
to induce negative supercoiling near the TSS. We find that the TFIIB 
linker helix and the TFIIF arm domain align with each other at the 
promoter melting start site, probably to facilitate the separation of 
the two strands. Once promoter DNA melting is further extended 
and the Pol II clamp closes down, the TFIIB linker helix and the 
TFIIF arm domain work together with the Pol II rudder to maintain 
the upstream edge of the DNA bubble. 

Finally, the arrangement of components within our PIC structure is 
compatible with existing structural models that include the large, 
multi-subunit mediator and TFIID complexes’’. Future structural 
studies with mediator and/or TFIID will yield further insight regard- 
ing the regulation of PIC assembly and function. In summary, this 
work provides the structural framework needed to integrate bio- 
chemical and structural data into a unified mechanistic understand- 
ing of transcription initiation. 


METHODS SUMMARY 

Protein expression and purification. TBP, TFIIA, TFIIB, TFIIE and TFIIF were 
recombinantly expressed and purified from Escherichia coli. Pol II and TFIIH 
were immunopurified from HeLa cell nuclear extracts”. The design of the DNA 
construct was based on the SCP”, with a Sall restriction enzyme site introduced 
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downstream of the INR element. PIC complexes were assembled according to an 


in vitro transcription protoco 


1’? with minor modifications (see Methods). The 


reactions were incubated with magnetic streptavidin T1 beads (Invitrogen) and 
the desired complexes were eluted by Sall digestion. 

EM and image analysis. Data collection and image processing were conducted 
using the Leginon data collection software“ and the Appion electron microscopy 
processing environment”, respectively. Three-dimensional maps were calculated 
using libraries from the EMAN2 and SPARX software packages***”. Volume 
segmentation, automatic rigid-body docking, figure and movie generation were 
performed using UCSF Chimera®. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

PIC assembly and purification. TBP, TFIIA, TFIIB, TFIIE and TFIIF were 
recombinantly expressed and purified from Escherichia coli. Pol Il, TFIID and 
TFIIH were immunopurified from HeLa cell nuclear extracts following prev- 
iously established protocols**’. The design of the DNA construct was based on 
the SCP”, except that a BREu element was introduced upstream of the TATA 
box" and a Sall restriction enzyme site was included downstream of the INR 
element for purification purposes (templatel, 5’-ACTGGGAAGTCGACCGG 
TCCGTAGGCACGTCTGCTCGGCTCGAGTGTTCGATCGCGACTGAGGAC 
GAACGCGCCCCCACCCCCTTTTATAGGCGCCCTTC-3’; nontemplatel, 5’- 
GAAGGGCGCCTATAAAAGGGGGTGGGGGCGCGTTCGTCCTCAGTCGC 
GATCGAACACTCGAGCCGAGCAGACGTGCCTACGGACCGGTCGACTT 
CCCAGT-3’). The nucleic acid scaffold that was used to generate the PIC in the 
open conformation was designed by modification of the promoter substrate used 
to form the closed PIC. An RNA-DNA duplex beyond 7 bp has been proposed to 
be the trigger for TFIIB release and promoter escape'***. Thus, we replaced the 
segment of DNA containing the INR element with a 3’ -tailed sequence previously 
used to create an arrested transcription state in yeast Pol II**. We matched the 
arrested position of Pol II on the template exactly to the TSS used in our studies, 
thereby creating a Pol II-nucleic acid complex containing only about 5 nucleotides 
at the active site while still containing upstream core promoter elements available 
for assembling the rest of the PIC (template2, 5’-ACTGGGAAGTCGACCGGT 
CCGTAGGCACGTCTGCTCGGCTCGAGTGAGCTAGCTTACCTGGTGTTG 
CTCTAACCCCCACCCCCTTTTATAGGCGCCCTTC-3’; nontemplate2, 5'-G 
AAGGGCGCCTATAAAAGGGGGTGGGGGTT-3’; nontemplate3, 5’-GAGGT 
AAGCTAGCTCACTCGAGCCGAGCAGACGTGCCTACGGACCGGTCGACT 
TCCCAGT-3’). A biotin tag was engineered at the 5’ end of both template strands 
(Integrated DNA Technologies). The duplexed DNA was generated by annealing 
the template strand with equimolar amounts of single-stranded non-template 
DNA at a final concentration of 50 1M in water. The annealing reaction was 
carried out at 100 °C for 5 min and gradually cooled down to room temperature 
within 1h. 

PICs in the closed conformation were assembled according to an in vitro 
transcription protocol” with minor modifications. The assembly buffer con- 
tained 12 mM HEPES, pH 7.9, 0.12 mM EDTA, 12% glycerol, 8.25 mM MgCh, 
60mM KCl, 1mM DTT, 0.05% NP-40, 2.5ngyl-! dI-dC, 10M ZnCh. The 
following purified proteins and nucleic acids were sequentially added into the 
assembly buffer: Pol II, TFIIB, TBP/TFIIA, DNA (templatel—-nontemplate1), 
TFIIF and TFIE at final concentrations of 185nM, 3.6uM, 370nM, 50nM, 
289 nM and 370 nM, respectively. The assembly reaction was kept at 37 °C for 
an additional 5 min whenever a new factor was added. The reaction was incubated 
at 28 °C for 15 min using a 1:10 dilution of the magnetic streptavidin T1 beads 
(Invitrogen) which had been equilibrated with the assembly buffer. Following 
washing of the beads three times using a washing buffer (10 mM HEPES, 10 mM 
TRIS, pH 7.9, 5% glycerol, 5mM MgCh, 50 mM KCl, 1 mM DTT, 0.05% NP-40, 
5 uM ZnCl,), TFIH at a final concentration of 100nM was incubated with the 
beads in assembly buffer at 37°C for an additional 5 min. Following a single 
additional wash of the beads using washing buffer, the desired complex was eluted 
by incubating the beads at 28 °C for 1h with digestion buffer containing 10 mM 
HEPES, pH7.9, 5% glycerol, 10mM MgCl, 50mM KCl, 1mM DTT, 0.05% 
NP-40, 1 unit wl? BSA-free SalI-HF (New England Biolabs). The various PIC 
intermediates were generated by including just the factors of interest during the 
assembly process described above. For preparing TBP-TFIIA-TFIIB-DNA-Pol II 
-TFIIF-TFIIE complex, extra TFIIE was added afterwards to the purified PIC at 
a final concentration of 100 nM. 

It was not possible for us to reconstitute an open complex using either a 
mismatch DNA bubble, probably due to failure of efficiently and specifically 
positioning Pol II on the bubble, or a nucleic acid scaffold containing an RNA 
primer, probably because an RNA-DNA duplex of over 7 bp in length has been 
proposed to be the trigger for TFIIB release and promoter escape'*”’. PIC in the 
open conformation was assembled similarly, except for the following changes. An 
arrested Pol II on the open promoter nucleic acid scaffold was first prepared by 
incubation at 28 °C for 1 h of Pol II and DNA (template2-nontemplate3) at final 
concentrations of 300nM and 80 nM, respectively, in the arresting buffer con- 
taining 12 mM HEPES, pH 7.9, 0.12 mM EDTA, 12% glycerol, 8.25 mM MgCl, 
60mM KCl, 1mM DTT, 0.05% NP-40, 2.5ng wl? dI-dC, 1:100 dilution of 
RNasin Ribonuclease inhibitor (Promega), and 2mM CTP. The following puri- 
fied proteins and nucleic acid were sequentially added into the arrested Pol II 
reaction above: nontemplate2, TBP/TFIIA, TFIIB, TFIIF and TFIIE at final con- 
centrations of 200 nM, 370 nM, 3.6 1M, 289 nM and 370 nM, respectively. The 
desired open-promoter complex was then purified in the same manner as the 
closed complexes above. 
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Purified PIC complexes were crosslinked after elution by incubation with 

glutaraldehyde at a final concentration of 0.05%, on ice and under very low 
illumination conditions, for 5 min, then immediately used for EM sample pre- 
paration (either negative stain or cryo-plunging). 
Electron microscopy. Negative stain samples of PIC complex and of free TFIIH 
were prepared using 400 mesh copper grid containing a continuous carbon 
supporting layer. The grid was plasma cleaned for 10 s immediately before sample 
deposition using a Solarus plasma cleaner (Gatan) equipped with 75% argon/25% 
oxygen. An aliquot (3 il) of the purified sample (~50 nM) was placed onto the 
grid and allowed to absorb for 5 min at 100% humidity in a homemade humidity 
chamber kept under very low illumination conditions. It was subsequently 
stained by five successive 75 ll drops of 2% (w/v) uranyl formate solution, rocking 
10s on each drop followed by blotting till dryness. Data collection was performed 
using a Tecnai F20 Twin transmission electron microscope operating at 120 keV 
at a nominal magnification of 80,000 (1.37 A per pixel). The data were collected 
using the Leginon data collection software** on a Gatan 4k X 4k camera using 
low-dose procedures (20 e— As? exposures) and a range of defocus values (from 
—0.5 to —1.2 um). Between 300 and 600 images were acquired for each of the 
negative stain data sets. 

Preparation of PIC samples for cryo-EM observation was carried out using 400 

mesh C-flats containing 4}1m holes with 41m spacing (Protochips). A thin 
carbon film was floated onto the grid before it was plasma cleaned for 5s using 
a Solarus plasma cleaner (Gatan) equipped with 75% argon/25% oxygen gas 
immediately before sample deposition. An aliquot (3 il) of the purified sample 
(~100 nM) was placed onto the grid and loaded into a Vitrobot (FEI) at 100% 
humidity and 4°C. The sample was allowed to absorb for 5min (under low 
illumination conditions), then was blotted for 4s and immediate plunged into 
liquid ethane. The frozen grids were stored in liquid nitrogen until loaded into a 
Tecnai F20 Twin transmission electron microscope operating at 120 keV using a 
626 single-tilt cryotransfer system (Gatan). Data were acquired at a nominal 
magnification of X100,000 (1.05A per pixel) using low-dose procedures 
(20e A? exposures) and a range of defocus values (from —1.2 to —2.4 um). 
Between 1,500 and 3,200 images for each of the cryo data sets were collected using 
the MSI-T application of the Leginon data collection software“. 
Image processing. Negative stain data pre-processing was performed using the 
Appion processing environment”. Particles were automatically selected from the 
micrographs using a difference of Gaussians (DoG) particle picker’’. The contract 
transfer function (CTF) of each micrograph was estimated using both the ACE2 
and CTFFind programs during data collection***’, the phases were flipped using 
CTFFind, and particle stacks were extracted using a box size of 256 X 256 pixels 
(except for both the TFIIH containing PIC complex and free TFIIH samples, 
which use 320 X 320 pixel boxes) from images whose ACE2 confidence value was 
greater than 0.8, followed by normalization using the XMIPP program to remove 
pixels which were above or below 4.5a of the mean value”. The particle stack was 
binned by a factor of two and two-dimensional classification was conducted using 
iterative multivariate statistical analysis and multireference alignment analysis 
(MSA-MRA) within the IMAGIC software’’. Class averages containing properly 
assembled complexes were manually selected and re-extracted to create a new 
particle stack for reconstruction. 

Cryo data processing was performed in a similar manner as the negative stain 
data. Particle stack was extracted using 384 < 384 pixel box size from phase- 
flipped images and binned by a factor of two. 

Three-dimensional reconstruction. The cryo-negative staining structure of free 
Pol ID’, after low-pass filtering to 60 A, was used as initial model for reconstruc- 
tion of all the negatively stained PIC samples. For reconstruction of the TFIH 
containing PIC, which has a substantial extra mass with respect to Pol II, the 
negative stain reconstruction of the TBP-TFIIA-TFIIB-DNA-Pol II-TFIIF- 
TFIE sample, after low-pass filtering to 60 A, was instead used as the initial 
reference (Supplementary Fig. 1). For reconstruction of the free TFIIH, the core 
TFIH density segmented from the negative stain TBP-TFIIA-TFIIB-DNA- Pol II 
-TFIIF-TFIIE-TFIIH refined model was used as the initial reference, after 
low-pass filtering to 60 A resolution. Three-dimensional reconstruction was con- 
ducted using an iterative multi-reference projection-matching approach con- 
taining libraries from the EMAN2 and SPARX software packages***” with two 
identical copies of the initial model as references. This step allowed us to further 
eliminate contamination, aggregated, or damaged complexes, which became 
enriched in one of the reconstructions. Refinement began at an angular step of 
25° and progressed down to 4° angular increments. At each step, refinement 
proceeded to the next angular step only once >95% of the particles had a pixel 
error of <1 pixel. The particle numbers contributing to the final negative stain 
reconstructions were 11,880 for TBP-TFIIA-TFIIB-DNA-Pol II, 13,770 for as 
previous plus TFIIF, 15,656 for as previous plus TFIIE, 64,712 for as previous plus 
TFIIH, and 13,023 for free TFIIH. The resolution of the reconstruction was 
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estimated using the 0.143 Fourier shell correlation (FSC) criterion to be about 
15 A for PIC complexes and 20 A for free TFIIH. 

Cryo-EM reconstructions were performed in a similar manner. The negative 
stain reconstruction of TBP-TFIIA-TFIIB~-DNA-Pol II-TFIIF was used as the 
initial reference for all the cryo reconstructions, except for that of TBP-TFIIA-— 
TFIIB-DNA-Pol II, which used its corresponding negative stain model as the 
initial reference (Supplementary Figs 2-5). All initial models were low-pass filtered 
to 60 A resolution. The particle numbers contributing to the final reconstructions 
were 122,480 for TBP-TFHA-TFIIB-DNA-Pol II, 43,785 for as previous plus 
TFIIF, 51,043 for as previous plus TFIIE, and 53,505 for as previous in the open 
conformation. To dampen low-resolution amplitudes of the final maps, the 
Fourier amplitudes were adjusted to match an experimental GroEL SAXS curve 
using the SPIDER software”. The estimated resolution, using the 0.143 FSC cri- 
terion, was between 11 and 13 A for the cryo-EM reconstructions of PIC assembly 
intermediates (Supplementary Figs 2-5). 

Local resolution calculation was performed for all reconstructions using the 
‘blocres’ function in the Bsoft package®*'(Supplementary Figs 2-6). Volume 
segmentation, automatic rigid-body docking, figure and movie generation were 
performed using UCSF Chimera“. The globally bent DNA model was generated 
using the 3D-DART online server and the 3DNA software package. 
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Carbon monoxide in clouds at low metallicity in the 
dwarf irregular galaxy WLM 


Bruce G. Elmegreen', Monica Rubio”, Deidre A. Hunter’, Celia Verdugo’, Elias Brinks* & Andreas Schruba® 


Carbon monoxide (CO) is the primary tracer for interstellar clouds 
where stars form, but it has never been detected in galaxies in which 
the oxygen abundance relative to hydrogen is less than 20 per cent 
of that of the Sun, even though such ‘low-metallicity galaxies often 
form stars. This raises the question of whether stars can form in 
dense gas without molecules, cooling to the required near-zero 
temperatures by atomic transitions and dust radiation rather than 
by molecular line emission’; and it highlights uncertainties about 
star formation in the early Universe, when the metallicity was 
generally low. Here we report the detection of CO in two regions 
of a local dwarf irregular galaxy, WLM, where the metallicity is 13 
per cent of the solar value**. We use new submillimetre observa- 
tions and archival far-infrared observations to estimate the cloud 
masses, which are both slightly greater than 100,000 solar masses. 
The clouds have produced stars at a rate per molecule equal to 10 
per cent of that in the local Orion nebula cloud. The CO fraction of 
the molecular gas is also low, about 3 per cent of the Milky Way 
value. These results suggest that in small galaxies both star-form- 
ing cores and CO molecules become increasingly rare in molecular 
hydrogen clouds as the metallicity decreases. 

Wolf-Lundmark-Melotte (WLM) is an isolated dwarf galaxy at the 
edge of the Local Group’. It has a low star-formation rate because of its 
small size and, like other dwarf irregular (dIrr) galaxies, shows no 
previous evidence* for the molecular gas that always accompanies 
young stars in larger galaxies’. One problem with the detection of 
molecules is that the dominant tracer of such gas is CO, and dIrr 
galaxies have low carbon and oxygen abundances relative to hydrogen. 
No galaxy with an O/H abundance less than 20% has been detected 
using CO as a tracer’. Far more abundant is molecular hydrogen 
(H,), but this does not have an observable state of excitation at the 
low temperatures (~ 10-30 K) required for star formation. 

To search for star-forming gas, we surveyed WLM for CO(J = 3-2) 
emission in rotational state J and for continuum dust emission at 
345 GHz using the Atacama Pathfinder Experiment (APEX) telescope 
at Llano de Chajnantor, Chile, with the Swedish Heterodyne Facility 
Instrument’’ and the Large APEX Bolometer Camera’ (LABOCA). 
We also used a map of dust emission at 160 [um from the Spitzer Local 
Volume Legacy Survey’” and a map of atomic hydrogen re-reduced 
from the archives of the Jansky Very Large Array radio telescope. The 


Table 1 | Observations of WLM 


dust measurements can be converted to a dust temperature and a dust 
mass, and, after applying a suitable gas-to-dust ratio, to a gas mass 
from which the H1 mass can be subtracted to give the H2 mass for 
comparison with CO. 

Figure 1 shows WLM and the two regions, designated A and B, 
where we detected CO(3-2) emission, along with H1, far-infrared 
(FIR) and submillimetre images. Observed and derived parameters 
are listed in Tables 1 and 2, respectively. The peak CO brightness 
temperature in each detected region is ~0.01-0.015 K and the line- 
width is ~12kms° ’ (full-width at half-maximum). Previous efforts to 
detect CO(J = 1-0) in WLM*® partly overlapped region A with a 45’’ 
aperture and determined a 5¢ upper limit to the CO(1-0) intensity of 
0.18Kkms_ ’. Our observation with an 18’’ aperture yields an intens- 
ity of 0.200 + 0.046Kkms ' for CO(3-2) in the same region. The 
difference arises because the CO cloud is unresolved even by our 
18'’ beam—we did not detect comparable CO(3-2) intensities in 
our searches adjacent to region A. The previous upper limit corre- 
sponds to a maximum CO(1-0) luminosity of 8,300Kkms_! pc” 
inside 45'’ (which corresponds to a beam diameter of 215 pc at 
WLM), whereas the cloud we detect has a CO(3-2) luminosity ~6 
times smaller (1,500 K km st pe’). Likewise, the previous null detec- 
tion’ in CO(UJ = 2-1) claimed a 5¢ upper limit that is about the same as 
our CO(3-2) detection, but their closest pointing differed from 
region A by ~70 pc (14’’, or half the beam diameter for CO(2-1)), 
which could have been enough to take it off the CO cloud. 

The 160-t1m, 870-j1m and H1 peaks are slightly offset from the CO 
positions, indicating variations in temperature and molecular frac- 
tion. A large H1 and FIR cloud that surrounds region A, designated 
region A1, was used to measure the dust temperature, Ty ~ 15 K, which 
was assumed to be the same throughout the region (the 160-j1m obser- 
vation does not resolve region A, and so a more localized temperature 
measurement is not possible). We determined Ty from the 870-um and 
160-{1m fluxes corrected for the CO(3-2) line and broadband free-free 
emission (Table 1), assuming a modified black-body function with dust 
emissivity proportional to frequency to the power f. Local measure- 
ments’ suggest that / = 1.78 + 0.08, although a range is possible’*"*, 
depending on grain temperature and properties’®. The 870-1m flux was 
also corrected for an unexplained FIR and submillimetre excess that is 
commonly observed in other low-metallicity galaxies'”'*. An alternate 


Source Region Right ascension Declination Beam diameter ('’) Flux 

CO(3-2) A Oh 1 min 57.32s —15° 26’ 49.5” 18 0.200 + 0.046 Kkms + 

Hi A Oh 1 min 57.32s —15° 26’ 49.5” 22 774 +40 mJy kms? 

870 um A Oh 1 min 57.32s —15* 26' 49.5" 22 2.66 + 0.53 mJy (0.11, 0.02)* 
Hi Al Oh 1 min 56.93 s —15° 26’ 40.84” 45 4,170 + 82 mJy km ae 

870 um Al Oh 1 min 56.93 s —15° 26’ 40.84” 45 15.2 + 3.0 mJy (0.11, 0.06)* 
160 pm Al Oh 1 min 56.93 s —15° 26’ 40.84" 45 136.2 + 13.6 mJy (0.05)+ 
CO(3-2) B Oh 2min 1.68s =—15°27' 52.5" 18 0.129 + 0.032 Kkms + 


* Quantities in parentheses are the CO(3-2) flux and the free-free emission, both in mJy, that were subtracted from the source flux before calculating the dust flux. 
+ Quantity in parentheses is the free-free emission, in mJy, that was subtracted from the source flux before calculating the dust flux. The average FIR excess factor’® for the Small Magellanic Cloud (SMC)is 1.7,so we 
divide the CO-corrected and free-free-corrected 870-1m fluxes in the table by 1.7 to get the thermal dust flux. 


11BM Research Division, T.J. Watson Research Center, 1101 Kitchawan Road, Yorktown Heights, New York 10598, USA. Departamento de Astronomia, Universidad de Chile, Casilla 36-D, Santiago, Chile. 
3Lowell Observatory, 1400 West Mars Hill Road, Flagstaff, Arizona 86001, USA. “Centre for Astrophysics Research, University of Hertfordshire, Hatfield AL10 9AB, UK. Cahill Center for Astronomy and 
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Table 2 | Derived quantities for WLM 
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Figure 1 | Observations of the 
galaxy WLM. WLM is a small, gas- 
rich galaxy 985 + 33 kpc from the 
Milky Way’*. It contains 

1.6 X 10’Mo of stars”*, compared 
with (6.4 + 0.6) X 10'°Mo in the 
Milky Way”, and it forms new stars 
at a rate” of 0.006Mo yr, which is 
12 times higher per unit stellar mass 
than the Milky Way”. a, Colour 
composite image: red, Hi; green, 

V band; blue, GALEX far-ultraviolet. 
For H1 the aperture was 7.6'’ and the 
resolution was 2.6 kms !, and for 
CO(3-2) the aperture was 18’’ 
(circles) and the resolution was 
0.11kms }, although the CO(3-2) 
spectra shown in the figure were 
smoothed to a resolution of 

2.2kms |. The CO detections are 
labelled; their exposure times were 
218 min (region A) and 248 min 
(region B). Other regions searched 
with exposure times shorter by 
factors of ~2 to ~6 are indicated by 
plus signs; the presence of 
comparable CO mass in some of 
these other regions cannot be ruled 
out. b, Spectra of the two detections: 
velocities are relative to the local 
standard of rest; CO labels main- 
beam brightness temperature, Tn, 
in kelvin; H 1 labels flux in Jy. c, False- 
colour image of the 870-lm 
observations made with LABOCA on 
APEX. d, False-colour Spitzer 160- 
um image obtained from Spitzer 
archives. In c and d, the images show 
the same field of view and the small 
circles (22'’ diameter, the resolution 
of LABOCA) show where CO was 
detected. The large circle is 45’’ in 
diameter and surrounds a large HI 
and FIR cloud (region Al) where the 
dust temperature was measured. 


Source Region T (K) =* (Mo pe~?) Mass (Mo) 

B= 1.8, aco = 124 + 60 

Dust A 14.7 + 0.7% 0.053 + 0.014 (4.6 + 1.2) x 10? 
Gass A (58 + 15)d¢p ((5.1 £ 1.3) X 10°)dep 
Hil A 27.3414 (2.4+0.1) x 10° 

Ho A 31415 (1.8 + 0.8) x 10° 
Hof] B 20+10 (1.2 + 0.6) x 10° 
B=1., ac0 =34+34t 

Dust A 15.9 + 0.8% 0.032 + 0.008 (2.8 + 0.7) x 102 
Gass A (36 + 9)dgp ((3.1 £0.8) X 10°)dep 
Ho A 8.349 (0.5 + 0.5) x 10° 
Hof] B 5.3+6 (0.3 0.3) x 10° 
B=2,0c0 = 271 +97+ 

Dust A 13.6 + 0.6% 0.087 + 0.022 (7.5+1.9) x 10? 
Gass A (95 + 24)5ep ((8.3 2.1) X 10°)dep 
Ho A 67 +24 (3.9+ 1.4) x 10° 
Hof] B 44+16 (2.5 + 0.9) x 10° 


* Mass column density of gas or dust. 


+The units of aco are Mo pe 2K! km~*s. The uncertainty is dominated by the uncertainties in the 160-1m and 870-um fluxes, as indicated by their error limits in Table 1. The error limits are approximately 


symmetric. 


The dust temperature in region A is assumed to be the same as the measured dust temperature in region Al. 
§ dgp = Rap/1,100 is the gas-to-dust ratio, Rep, normalized by the solar value and scaled to the metallicity of WLM. Lowering dgp lowers aco, but this does not seem reasonable: data suggests that the gas-to-dust 
mass ratio is ~5,000 for 12 + log(O/H) = 7.8 (ref. 27), and this implies larger values of dgp (~4.5) and aco. The gas mass and resulting xco value also depend on the assumed correction factor of 1.7 for 
submillimetre excess. With no correction for this excess, %co increases for all f values: at § = 1.8, xcq = 370. Solutions with no submillimetre excess correction and lower f values? can be found in Supplementary 
Information. In addition, «co depends on the assumed value of CO(3-2)/CO(1-0), which was taken to be 0.8 in Table 2; a value of CO(3-2)/CO(1-0) = 1 increases aco to 155 for B = 1.8. 


|| The Hi mass column density is corrected for helium and heavy elements. 


The molecular mass for region B was calculated using the CO integrated intensity and the value of %c¢o determined from region A. 
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combination of lower / with no submillimetre correction” gives similar 
results (Supplementary Information). The dust mass was calculated 
from an emissivity'* of x = 13.9cm? gm‘ at 140 jim, and converted 
to 870 1m with the same power-law index, /. The dust mass for 
region A is then Maust.a = Ss7o.aD?l KB,(Ta) for flux Sg7o,a, distance D 
and black-body spectral function B,. 

Dust mass is converted to gas mass using a factor equal to the gas-to- 
dust mass ratio, Rgp. An approximation’ is to assume the solar value” 
(1/0.007) increased by the inverse of the metallicity of WLM (0.13), 
which would give 1,100. We use this approximation here, but intro- 
duce a scaling factor to the gas mass, dgp = Rgp/1,100; that is, the gas- 
to-dust ratio normalized by the solar value and scaled to the metallicity. 

The results are in Table 2, assuming f = 1.8 as the fiducial value and 
comparing the results with those for 6 = 1.6 and 2 to illustrate the 
dependence of the results on /. Dust and gas mass correlate’* with the 
assumed f} value. The total gas mass column density in a 22’ region 
around region A is ~58 + 15Mo pe © for B = 1.8 (Mo, solar mass). 
Atomic hydrogen contributes ~27.3 + 1.4Mo pe *, and the remain- 
der is ascribed to molecular H, traced by the observed CO. 

The integral under the CO(3-2) line from regionA is Ico = 
0.200 + 0.046 Kkms 1. This intensity has to be converted to CO(1-0) 
before comparing it with H, mass in the conventional way. We take as 
a guide’ the value of CO(3-2)/CO(1-0) ~ 0.80 in another low- 
metallicity galaxy, the SMC” (where O/H = 20% of the solar value). 
The result, 0.25Kkms 1, is combined with the H, mass column 
density to determine the conversion factor, %co, from CO(1-0) to 
Hb). If %co can be calibrated as a function of metallicity, then CO 
observations can be used directly to infer the molecular gas con- 
tent irrespective of the dust spectral energy distribution. Extensive 
compilations’ ’ show %co increasing strongly at lower metallicity, from 
~4Mo pe ~K ‘km’ 's in the Milky Way’, where the metallicity? is 
12 + log(O/H) = 8.69, down to the previous CO detection limit* in the 
SMC, where aco ~ 70Mo pe °K 'km™'s at 12+ log(O/H) = 8.0. 
Our observations of WLM? at a metallicity of 12 + log(O/H) = 7.8 
continue this trend. 

Taking the H, column density from the residual between the dust- 
derived total and the H1 column density, 31 + 15Mo pe’, and divid- 
ing by the inferred CO(1-0) line integral of 0.25K kms ', we obtain 
&co = 124+60Mo pe *K ‘km 's including helium and heavy 
elements, with a range in %&co from 34 to 271 as f varies from 1.6 
to 2. The corresponding factor, Xco, for conversion from Ico to Hz 
column density would be (5.8 + 2.8) X 10*’cm~*K~ ‘km’ 's, ranging 
from 1.5 X 10°! to 1.3 X 10” as f varies between 1.6 and 2. There is a 
large uncertainty because of the unknown dust properties (f, «, dap 
and the submillimetre excess) and molecular excitation (CO(3-2)/ 
CO(1-0)) in dirr galaxies. 

The star-formation rate based on the Ha and far-ultraviolet** fluxes 
within an 18’’ aperture centred on cloud A is (3.9-4.8) X 10° °Mo yr. 
Dividing these rates into the CO-associated molecular mass using 
&co = 124Mg pe *K ‘km 's gives a CO molecular consumption 
time (for converting gas into stars) of 4.6-3.8 Gyr for region A. In 
region B, the star-formation rate from Ho and far-ultraviolet fluxes 
is (1.7-12.6) X 10° *Mo yr * and the CO molecular consumption 
time is 6.7-1.5 Gyr. These times are only slightly larger than the aver- 
age value in spiral galaxies”, which is ~2 Gyr, but they are ten times 
larger than the rate per molecule in local giant molecular clouds”, 
which is a more direct analogy with our observations. 

The detection of CO in WLM suggests that star formation continues 
to occur in dense molecular gas even at lower metallicities than prev- 
iously observed. The similarity between the metallicities of dIrr galaxies 
such as WLM and those of larger galaxies at high redshift”* implies that 
we should be able to study star formation in young galaxies using the 
usual techniques. 
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Evidence for sympathetic vibrational cooling of 
translationally cold molecules 


Wade G. Rellergert', Scott T. Sullivan', Steven J. Schowalter', Svetlana Kotochigova’, Kuang Chen! & Eric R. Hudson! 


Compared with atoms, molecules have a rich internal structure 
that offers many opportunities for technological and scientific 
advancement. The study of this structure could yield critical 
insights into quantum chemistry'*, new methods for manipulating 
quantum information**, and improved tests of discrete symmetry 
violation®’ and fundamental constant variation® ’°. Harnessing this 
potential typically requires the preparation of cold molecules in 
their quantum rovibrational ground state. However, the molecular 
internal structure severely complicates efforts to produce such 
samples. Removal of energy stored in long-lived vibrational levels 
is particularly problematic because optical transitions between 
vibrational levels are not governed by strict selection rules, which 
makes laser cooling difficult. Additionally, traditional collisional, 
or sympathetic, cooling methods are inefficient at quenching mole- 
cular vibrational motion’'. Here we experimentally demonstrate 
that the vibrational motion of trapped BaCl* molecules is quenched 
by collisions with ultracold calcium atoms at a rate comparable to 
the classical scattering, or Langevin, rate. This is over four orders of 
magnitude more efficient than traditional sympathetic cooling 
schemes". The high cooling rate, a consequence of a strong inter- 
action potential (due to the high polarizability of calcium), along 
with the low collision energies involved’, leads to molecular 
samples with a vibrational ground-state occupancy of at least 90 
per cent. Our demonstration uses a novel thermometry technique 
that relies on relative photodissociation yields. Although the 
decrease in vibrational temperature is modest, with straightforward 
improvements it should be possible to produce molecular samples 
with a vibrational ground-state occupancy greater than 99 per cent 
in less than 100 milliseconds. Because sympathetic cooling of mole- 
cular rotational motion is much more efficient than vibrational 
cooling in traditional systems, we expect that the method also 
allows efficient cooling of the rotational motion of the molecules. 
Moreover, the technique should work for many different combina- 
tions of ultracold atoms and molecules. 

Precision control over the quantum states of atoms has allowed 
tremendous advances in both applied and fundamental physics, ranging 
from ultraprecise clocks’’ and gravity gradiometers", to stringent tests 
on discrete symmetry violations'* and the production of exotic phases 
of matter such as degenerate Bose and Fermi gases'®. Extension of these 
studies to the next simplest such physical system, that is, a diatomic 
molecule, promises even further progress. Typically, the starting point 
for these studies is the production of an ultracold sample, which in the 
case of atoms is usually attained by laser cooling. Applying this tech- 
nique to molecules, however, is technically challenging, owing mainly 
to the molecular vibrational degree of freedom, which has no asso- 
ciated angular momentum. Thus, unlike transitions between different 
rotational and electronic levels, transitions between vibrational levels 
of a molecule are not governed by strict selection rules. As a result, in 
general, a molecule simply cannot scatter the necessary number of 
photons needed to enter the ultracold regime before spontaneously 
decaying to long-lived vibrational levels that are out of resonance with 
the cooling lasers. Although it does seem that a subset of molecules 


are potentially amenable to laser cooling, as has been recently demon- 
strated for SrF (ref. 17) and YO (ref. 18), a more general cooling 
technique is highly desirable. One promising, general method is to 
use a sample of laser-cooled ultracold atoms to collisionally, that is, 
sympathetically, cool a co-trapped sample of molecules to ultralow 
temperatures. Particularly attractive for this technique are charged 
molecules, which, in contrast to neutral molecules, can be trapped 
over a broad temperature range for extended periods of time in 
radio-frequency ion traps, allowing ample time for sympathetic cool- 
ing to occur. 

So far, sympathetic cooling of charged molecules has been demon- 
strated with both laser-cooled atomic ions and cryogenic buffer gases’. 
Unfortunately, neither of these methods can produce molecules that 
are simultaneously ultracold in both their internal and external degrees 
of freedom. On the one hand, sympathetic cooling with co-trapped, 
laser-cooled atomic ions quickly cools molecular ions to very low 
translational temperatures, but, owing to the long-range nature of 
the Coulomb interaction, the method yields little internal-state relaxa- 
tion of the molecules at low collision energies’’. To overcome this 
limitation, techniques of optical pumping”*’ and _state-selective 
photoionization of neutral molecules” have been used in tandem with 
atomic ion sympathetic cooling to produce translationally cold mole- 
cular ions in the lowest few rotational states. Despite their impressive 
results, these techniques lack generality because they are restricted to 
certain classes of molecule and require molecule-specific lasers. On the 
other hand, sympathetic cooling with helium buffer gas has been 
demonstrated to yield cooling of both the translational and internal 
states of molecules owing to the short-range nature of the ion-neutral 
collision; however, the method can only be used at temperatures above 
~300 mK, the point at which the vapour pressure of *He becomes 
negligibly small. Additionally, it is relatively inefficient as a result of 
the low polarizability of the helium atoms. 

An alternative to these methods is sympathetic cooling with ultra- 
cold, laser-cooled atoms, which both allows access to submillikelvin 
translational temperatures and is predicted to be very efficient at cool- 
ing the internal degrees of freedom because their polarizabilities are 
more than 100 times greater than that of helium atoms. Although this 
scheme uses proven technologies that have existed for decades, it has 
not previously been implemented, possibly owing to the misconcep- 
tion that molecular ions cannot undergo primarily elastic collisions 
with neutral atoms that are amenable to laser cooling, but instead 
undergo charge-exchange chemical reactions leading to energetic, 
neutral molecules. Previously, we have shown that, on the contrary, 
many combinations of molecular ions and neutral laser-cooled atoms 
(including the combination of species used here) can coexist without 
undergoing a chemical reaction when colliding”. Here we present 
evidence of internal-state cooling of trapped molecular ions due to 
sympathetic cooling collisions with ultracold atoms co-located in a 
magneto-optical trap (MOT). Although the demonstrated degree of 
cooling is modest, the method is found to be extremely efficient, with a 
vibrational quenching rate constant on the order of the classical, or 
Langevin, atom-ion scattering rate constant. Thus, straightforward 
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improvements in the apparatus should allow production of molecular 
samples in the vibrational ground state in less than 100 ms. In what 
follows, we detail a novel technique that we have developed to measure 
relative populations of molecular vibrational levels, briefly describe the 
apparatus and experimental approach used to sympathetically cool 
BaCl* molecules, and present data detailing the measured vibrational 
sympathetic cooling rate. 

To quantify the degree of sympathetic cooling of the molecular 
internal levels, it is necessary to measure the population distribution 
of rotational and vibrational levels. Typically, this measurement is 
accomplished with spectroscopic techniques, such as predissociation 
or multiphoton ionization/dissociation spectroscopy; however, as for 
most molecular ions, the detailed spectroscopic information necessary 
for these techniques is not yet known for BaCl*. To overcome this 
challenge, we develop a new route to demonstrate vibrational spectro- 
scopic thermometry experimentally, which uses broadband molecular 
photodissociation and as such does not require detailed molecular 
structure data. This novel and general method is applicable to any 
molecule that can be directly photodissociated and allows an accurate 
measurement of the molecular vibrational temperature, as long as the 
internal-state population can be probed on a timescale shorter than or 
comparable to the vibrational relaxation time. The method exploits 
the fact that, although the photodissociation cross-section is broad, 
the individual vibrational levels have unique frequency responses for 
photodissociation, which are given by** 
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where E,, is the Hartree energy, e is the charge of the electron, a is the 
Bohr radius, « is the fine-structure constant, h is Planck’s constant, hv 
is the photon energy, ¢ is the energy of the continuum wavefunction 
and v is the vibrational quantum number of the X state. The rovibra- 
tional wavefunctions of the ground state, ly*) and first electronic 
excited state, 'TI (Fig. 1a), are respectively denoted |X) and |A), with 
rotational quantum numbers J and J’, which have space-fixed z-axis 
projections of M and M’. The quantity (A|d-|X) / (eao / En) is dimen- 
sionless and contains both the radial and angular parts of the dipole 
moment, d. Because the continuum wavefunctions associated with 
solutions of the A'II molecular potential are highly oscillatory, the 
expectation value of the dipole moment operator can be accurately 
approximated by considering only the nature of |A) near the classical 
turning point, Rc. Thus, if the repulsive potential is approximated as 
being linearly dependent on R, with slope dV4/dR, near R = Rc, these 
wavefunctions are well described by Airy functions and the vibrational 
photodissociation cross-section is given by 
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where @, are the real-valued vibrational wavefunctions. Because Rc is 
linearly related to the photon energy, the shape of the photodissocia- 
tion cross-section directly reflects the squared probability amplitude of 
the molecular vibrational wavefunction. 

Implementing the photodissociation thermometry technique to be 
discussed requires knowledge of a photodissociation transition from 
which an accurate determination of the ¢, can be obtained. Recently, 
we identified such a photodissociation transition in BaCl* (ref. 24). 
However, to implement the thermometry technique, it was neces- 
sary to measure the shape of the photodissociation cross-section to 
higher precision. To this end, we have developed a novel, compact 
time-of-flight apparatus”, which enabled us to measure the photodis- 
sociation spectrum of BaCl* about 1,000 times more precisely than in 
ref. 24. From this data (Fig. 1c), slight adjustments to the molecular 
potentials determined in ref. 24 have been made, and the contributions 
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Figure 1 | Photodissociation thermometry of BaCl*. a, Potential energy 
curves for the X'D* and A'TI electronic states and a schematic diagram 
indicating the photodissociation thermometry method described in the text. 
The different shapes of the v = 0 and v= 1 wavefunctions give rise to the 
different frequency responses of the two levels to the photodissociation laser. 
Yspr redistribution rate due to black-body radiation; yppz, transition rate due 
to the photodissociation laser given the cross-section opp. b, The permanent 
dipole moment (PDM) of BaCl* calculated as described in Methods. 

c, Improved data (errors, s.e.) obtained for the cross-section of the 
A'T<X'* photodissociation transition in BaCl* using a separate ion trap 
and time-of-flight system. The dash-dot line is the thermally averaged cross- 
section at 300 K. Also shown are the improved calculations of the individual 
vibrational-level contributions of the v = 0 and v = 1 levels. Although our 
model (Methods) incorporates the lowest four vibrational levels, here we show 
only the v = 0 and v = 1 levels, for clarity. 


of the individual vibrational states to the total photodissociation cross- 
section can be identified, the lowest two of which (that is, v= 0 and 
v = 1) are shown as solid lines in Fig. 1c. 

With the individual vibrational-level cross-sections in hand, the 
crux of the thermometry technique can be understood as follows. 
Once equilibration has been established with the vacuum apparatus, 
which operates at 300 K, ~79% of the molecules are in the v = 0 level, 
~15% are in the v = 1 level and the remaining population is essentially 
all in the v = 2 and v = 3 levels. Although the model introduced later 
incorporates all of these levels, for illustrative purposes it is sufficient 
to consider only the v = 0 and v = 1 levels. Because the wavefunction 
of the v= 1 level has a zero crossing (Fig. 1a) the photodissociation 
cross-section contains a node (Fig. 1b), which occurs at 2 = 225nm 
(v/c = 44,400 cm‘). A laser tuned to this wavelength will photodis- 
sociate molecules in the v = 0 level, but will not affect molecules in the 
v=1 level. Furthermore, as can be inferred by thermally averaging 
equation (1), at the temperatures relevant to this work the rotational 
distribution does not change the resulting cross-sections. As a result, 
if collisions with the ultracold Ca atoms quench the vibrational motion 
of the molecules, this quenching will be signalled by an enhanced 
photodissociation rate at this wavelength. Conversely, by tuning the 
photodissociation laser (PDL) closer to one of the maxima of the v = 1 
cross-section, MOT-induced quenching will instead be signalled by sup- 
pression of the observed photodissociation rate as molecules are being 
cooled to the v = 0 level, lowering the total cross-section. Asa result, we 
can infer the change in vibrational temperature from the change in 
photodissociation rate at different photodissociation wavelengths. 

The hybrid atom-ion apparatus, which we call the MOTion trap, 
used to measure these effects is similar to the one used in refs 26, 27 
(a detailed schematic diagram can be found in Supplementary Infor- 
mation). Typical data obtained to demonstrate vibrational cooling 
is plotted in Fig. 2a. To begin, BaCl* molecules created by laser abla- 
tion are trapped in a linear quadrupole ion trap (Fig. 2a, i), which is 
co-located with a MOT that can be loaded with ultracold Ca atoms. 
Notably, without any cooling, the initial internal and translational 
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Figure 2 | Measurement of MOT-induced vibrational quenching of BaCl* 
ions. a, Example traces of the Ba’ ion fluorescence measured during the 
experimental sequence. Initially the trap is loaded with a pure BaCl* sample 
(pink) (i). The PDL is then turned on to create a mixed sample of BaCl* and 
Ba’ ions (green) (ii). Once the Ba’ ions have been laser-cooled, the MOT 
(blue) is optionally turned on and the BaCl” ions are collisionally cooled by the 
ultracold Ca atoms (iii). Following that, the BaCl* ions are probed with the 
PDL to determine the amount of vibrational relaxation due to the MOT (iv). 
The ratio of the numbers of Ba” ions created in each instance of 
photodissociation with the MOT respectively on or off is then calculated. The 


temperatures of the BaCl’ molecules are determined by different 
mechanisms and are established in the first few seconds after the trap 
is loaded. Internally, the BaCl* molecules thermalize to 300 K owing 
to black-body radiation emitted by the room-temperature vacuum 
apparatus, while translationally the temperature is set by the ion 
dynamics in the trap. To attain measurements with the high signal- 
to-noise ratio necessary to demonstrate vibrational cooling, the system 
requires the use of an ion cloud composed of ~10° BaCl* molecules. 
At these numbers, the initial translational temperature of the Bacl* 
molecules is primarily set to ~1,000 K by collisional heating through 
micromotion interruption, which results when collisions between 
ions disrupt the stable ion trajectories in the trap allowing the radio- 
frequency confining potential to do net work on the ions**. Therefore, 
for this experiment, we use additional translational cooling of the 
BaCl* molecules by sympathetic cooling with co-trapped, laser-cooled 
Ba” ions. These Ba” ions are loaded into the trap with 100% efficiency 
by photodissociating a fraction of the BaCl* molecules using a pulsed 
dye laser as discussed in ref. 24 (Fig. 2a, ii), and, once laser-cooled, 
sympathetically cool the BaCl* molecules to millikelvin translational 
temperatures”. 

These atomic ions also provide an important diagnostic, because 
their fluorescence is used both as a means of determining overlap of 
the MOTion trap*® as well as the initial, proportional number of 
BaCl* molecules. 

Once the BaCl* molecules are translationally cold, as indicated by 
the steady-state value of the Ba* ion fluorescence and camera images 
of the ion cloud, the MOT is turned on (Fig. 2a, iii) and short-range 
collisions with the highly polarizable, neutral Ca atoms result in 
internal-state cooling. To measure the effect of the cooling, the PDL 
is then turned on once again (Fig. 2a, iv) and the increase in fluor- 
escence yields the number of Ba* ions produced, thus allowing an 
inference of the ground-state vibrational population of the BaCl* 
molecules. 

Several sources of systematic error can be eliminated (for example 
the variable number of BaCl* molecules produced in the loading 
process and possible inaccuracies in the absolute cross-section value, 
the PDL intensity or both) by taking the ratio of the final number of 
Ba* ions produced to the initial number, R,, = N>/N, (Methods and 
Fig. 2a). We then repeat this measurement sequence without turning 
on the Ca MOT and determine the same ratio, which we denote Rog. 
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top and bottom panels are data for PDL wavenumbers of 41,500 and 
44,675cm |, respectively. a.u., arbitrary units. b, The average of all data points 
(errors, s.e.) obtained, as described in Methods, is plotted for various 
photodissociation wavenumbers. The blue points are cooling data and the grey 
points are a control analysis that should be consistent with unity (Methods). 
The curves are the expected result from a rate equation model of the 
experimental sequence described in Methods. The agreement with the data 
shows that the BaCl* sample has been cooled to a ground-state vibrational 
population of ~90% in the average case. 


It is straightforward to show that in the linear regime in which we 
operate, the ratio Ro,,/Rog is equivalent to the ratio of the total cross- 
sections in each case, that is, Gon/Gofr. This sequence is repeated several 
times at several PDL wavenumbers. The analysed data (Methods) are 
plotted in Fig. 2b. These data are compared with a control analysis 
at each wavelength, which compares measurements for which the 
MOT remained on or off for both points, and the computed ratio 
should thus be consistent with unity (Methods). From this compar- 
ison, it is clear that the deviation of the actual data is the result of MOT- 
induced vibrational relaxation. 

To quantify the degree of cooling, we use a simple rate equation 
model that incorporates both the continual redistribution of the mole- 
cules’ internal states due to black-body radiation throughout the 
experiment and the MOT-induced vibrational relaxation when the 
MOT is present (Methods and Supplementary Information). Because 
the MOT density and overlap with the ions vary for each of the data 
points in Fig. 2b, a best-fit line using the rate equation is unsuitable. 
Rather, using the calculated effective density”® for each data point, along 
with the rate equation model, we determine an average vibrational 
quenching rate constant of k~ 1X 10 ?cm*s’ '. Using this rate con- 
stant with the typical peak MOT atomic density of ~1 X 10°cm™* and 
a typical overlap factor of 0.2 (Methods), we calculate the average 
expected values of Ron/Rog as predicted from the rate equation model 
(Fig. 2b, blue curve). Because the data agree well with this curve, we use 
the rate equations to extract the vibrational ground-state population, 
which in the case of sympathetic cooling by the Ca MOT is found to 
be P,p = 90%, set by the competition of sympathetic cooling and 
black-body radiation heating from the chamber walls. In contrast, if 
the MOT had provided no cooling, the data would have been consis- 
tent with the black line in Fig. 2b, which corresponds to a vibrational 
ground state population of ~79%. 

To our knowledge, rigorous quantum scattering calculations of the 
vibrational quenching rate have never been performed in any similar 
system. However, the large value of the rate constant, which is more 
than 10* times that observed in traditional sympathetic cooling 
schemes, is probably due to the strong, long-range C,/R* interaction 
potential between Ca and BaCl* (Cy = —ac,e”, where oc, is the polari- 
zability of Ca). Classically, particles colliding on this potential with 
energy E and an impact parameter less than the Langevin length, 
by =./C,/E undergo violent collision trajectories that rapidly spiral 
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inward towards R= 0. Quantum mechanically speaking, the system 
forms a three-body collision complex with reduced mass yi at the 
Langevin rate, ky =1\/2C4 / Lt, which facilitates redistribution of the 
original molecule’s vibrational energy between all three particles. Thus, 
when the collision complex predissociates back into its parent cons- 
tituents, this energy is carried away by the sympathetic cooling part- 
ner, leaving the molecule in a lower vibrational state. The fact that 
the observed rate constant is of the same order as the Langevin rate 
(k ~ k,/5) supports this interpretation’* and suggests that sympathetic 
cooling of molecular internal degrees of freedom will be possible with 
any collision partner that gives rise to long-range interactions con- 
ducive to collision complex formation, for example dipole-dipole and 
monopole-dipole collisions. 

Additionally, we have observed no evidence of any chemical reac- 
tions between BaCl* and electronically excited Ca atoms in the MOT, 
even though such reactions are energetically allowed. This supports the 
arguments of refs 27, 29, which state that these collisions are sup- 
pressed because the long-range atom-ion interaction shifts the atom 
out of resonance with the atomic cooling laser at large atom-ion 
separations and thus prevents atoms in their electronically excited 
state from colliding with the molecular ions. This also provides an 
important simplification to the originally proposed method”’, because 
it eliminates the need for an optical dipole trap for the ultracold atoms. 
Furthermore, it implies that the rate at which sympathetic cooling 
collisions occur can be increased without loss of molecules. Several 
straightforward experimental modifications, such as modifying the ion 
trap geometry such that the ion cloud is fully contained inside the Ca 
MOT and replenishing the Ca atom source, are expected to increase 
the effective Ca MOT density and, as a result, the sympathetic cooling 
collision rate by a factor of ~250. Such improvements would produce 
a vibrational ground-state population greater than 99% in less than 
100 ms, and would potentially render the currently used Ba* trans- 
lational sympathetic cooling unnecessary. 

Finally, owing to incomplete spectroscopic information on the 
BaCl* molecule, direct measurement of rotational energy quenching 
is currently not possible. However, because all evidence from sympath- 
etic cooling systems used so far shows that collisional quenching of 
the rotational energy proceeds at a much higher rate than vibrational 
quenching, it is likely that molecules have been cooled rotationally as 
well; experiments are currently underway to prove this. Ultimately, the 
minimum rovibrational temperature attainable with this method will 
be set by the ion radial micromotion, which determines the lowest 
collision energy with the Ca atoms. This collision energy can be esti- 
mated from the amplitude of the micromotion, which is given as a 
function of electric field as €mm = V2eE(r) /ma@Q, where w and Q are 
respectively the secular and drive frequencies’. The average micro- 
motion kinetic energy (mv>,,,/2=m(4mm@)°/2, where m is the ion 
mass and Vy is the component of the ion velocity due to micromo- 
tion) over a typical cloud profile in our apparatus for typical trap 
parameters (that is, a radial pseudo-potential of depth ~10 eV), is 
~1K, whereas the maximum possible micromotion contribution to 
a collision (for ions residing at the edge of the cloud) is ~10 K. Because 
the BaCl* vibrational splitting corresponds to ~500 K, this energy is 
inconsequential for vibrational cooling. However, the BaCl* rotational 
splitting is ~0.5 K, meaning that in the present experiment the lowest 
few rotational levels could still be populated. Therefore, future experi- 
ments aiming to produce samples of molecules purely in the rovibra- 
tional ground state should use smaller samples of molecular ions, so 
that the cloud size and, thus, the micromotion are reduced to the point 
where collision energies are below the molecular rotational splitting 
energy. In the present experiment, we estimate that this regime can be 
reached by working with a sample of order one-tenth the size of the 
current cloud. 

We have demonstrated a broadly applicable, general technique 
to cool the vibrational motion of polar molecules. We have also 
developed a general technique, photodissociation thermometry, for 
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probing molecular vibrational states, which we have used to deter- 
mine the vibrational quenching rate constant and the vibrational 
ground-state population. This result opens a new route to the produc- 
tion of ultracold molecules and is expected to enable myriad experiments 
in quantum chemistry’ ’, precision tests of fundamental physics*’, and 
quantum information and simulation*”. 


METHODS SUMMARY 


The data acquisition system takes an alternating sequence of data points for which 
the MOT is either on or off during the measurement, and acquires over 100 data 
points at each PDL wavelength. For each wavelength, the ratio of successive points 
(Ron/Rorr) is computed to eliminate unidentified time varying systematic errors, 
and the results are averaged. For the control analysis, the ratios between successive 
data points of the same type (for example Ron 1/Ron2 Rosei/Ror2 and so on) are 
instead computed and the results are averaged. See Methods for details. 

The production of Ba* ions in the experiment is modelled using a simple rate 
equation model that includes the lowest four vibrational levels of the BaCl* 
molecules and accounts for population redistribution among the levels throughout 
the experimental sequence. This redistribution of the levels is due to collisions with 
the ultracold Ca atoms as well as spontaneous and stimulated emission and 
absorption of photons by the molecules owing to the black-body radiation field. 
Because knowledge of the dipole moment of the electronic ground state of BaCl* is 
necessary to determine the spontaneous and stimulated emission and absorption 
rates, we calculated the dipole moment using a non-relativistic, multiconfigu- 
rational, second-order perturbation theory (CASPT2) implemented in the 
MOLCAS software suite. The result is shown in Fig. 1b. See Methods for details. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


To eliminate several sources of systematic error when comparing our data with the 
rate equation model, for example the variable number of BaCl* ions produced in 
the loading process and possible inaccuracies in the absolute cross-section value, 
the PDL intensity or both, we use the fluorescence as a measure of the number of 
Ba* ions created in each instance of photodissociation and calculate the ratio of 
the final amount to the initial amount, R,,, = N2/N, (Fig. 2a). We then repeat this 
measurement sequence without turning on the Ca MOT, and record the same 
ratio, which we denote Ro; It is straightforward to show that in the linear regime in 
which we operate the ratio Ron/Rog is equivalent to the ratio of the total cross- 
sections in each case, that is, Oon/@or We repeat this entire sequence >50 times 
and thus obtain a series of data points Roni, Rott1, Ron,2 Rote2,.... To eliminate 
monotonic, time-varying systematic errors, we compute and average the ratios 
Rona/Rote1s Ron2/Rort2; --. aS well as the ratios Ron,2/ Rote, Ron,3/Rotr2, ---- We then 
average the two results and repeat the experiment at various photodissociation 
wavelengths. These data are plotted as blue circles in Fig. 2b. Also plotted (black 
circles) are the results of a control analysis in which we instead perform the same 
calculation for the ratios Ronj/Ron,2> Rona/Ron,> ++» a8 Well as Rorei/Rore2 Rose3/ 
Roe, --. and average the results. Because these controls are comparisons of the 
same data type, they should yield results consistent with unity, and their variability 
about this value can be used as an indication of the variability in the actual data. 
Their close agreement with unity shows that the deviation from unity of the true 
data is due to MOT-induced vibrational relaxation and is not a statistical coincidence. 

The number of Ba* ions created in each instance of photodissociation can be 
obtained using the following rate equations: 


dN; oj(A)I 


= p®ki+1iNi+1 —kii-1Ni) - ~—N; 
apo (kit iiNisa i-1Ni) hy 
+ $0 AwN;— 50 AN; + > Byp(@)(Nj—M;) Q) 
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Here N; denotes the number of BaCl* molecules in the ith vibrational 
level (we have averaged over the rotational states) and Ng, is the number of 
Ba” ions. 

The first term in the expression for dN;/dt (equation (2)) is due to the MOT- 
induced quenching: k; 1; is the vibrational quenching rate constant for transition 
from the (i + 1)th level to the ith level, p is the peak density of the MOT and @ isa 
factor that quantifies the degree of overlap of the atom and ion clouds. We define 
®= J pca(r)p,(r) dr, where fc, is the unit-peak normalized Ca atom density and 
p, is the unit-integral normalized BaCl* density. Thus, 0 < <1. In the liquid 
phase in which we operate, ® quantifies the effective MOT density experienced by 
each ion as they move in the trap. 

The second term in the expression for dN;/dt results from photodissociation (0; 
is the wavelength-dependent cross-section for the ith level, I is the PDL intensity 
and v is the laser frequency). The last three terms are due to spontaneous emission 
and BBR-induced stimulated emission and absorption written in terms of the 
spectral energy density, p(w), and the Einstein A and B coefficients*”. 

Calculating these coefficients requires knowledge of the dipole moment of the 
electronic ground state of BaCl*, Ay =(2/ 3)(wjitG;/eohc*), where ju = (j|d(R)|i), 
|n) represents the rovibrational wavefunction of the nth rovibrational level 
of the ground electronic state, cj is the angular frequency of the transition, é is 
the vacuum permittivity and c is the speed of light. We calculated this dipole 
moment function, which was previously unknown, using a non-relativistic, 
multiconfigurational, second-order perturbation theory (CASPT2) imple- 
mented in the MOLCAS software suite. The result is shown in Fig. 1b. With 
this in hand, the necessary rates are calculated according to ref. 32, and are 
~1Hz. Finally, we incorporate the lowest four vibrational levels into the 
model and ignore terms where |i — j| > 1. This model, along with the calculated 
effective density of the MOT”® for each data point, allows a determination of the 
vibrational quenching rate constant, k, as well as the ground-state vibrational 
population, P,—o. 


32. Lefebvre-Brion, H. & Field, R. W. The Spectra and Dynamics of Diatomic Molecules 
Ch. 5 (Academic, 2004). 
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Deglacial pulses of deep-ocean silicate into the 
subtropical North Atlantic Ocean 


A.N. Meckler', D. M. Sigman”, K. A. Gibson?, R. Francois‘, A. Martinez-Garcia', S. L. Jaccard!, U. Rohl, LEG: Peterson’, 


R. Tiedemann® & G. H. Haug! 


Growing evidence suggests that the low atmospheric CO, concen- 
tration of the ice ages resulted from enhanced storage of CO in 
the ocean interior, largely as a result of changes in the Southern 
Ocean’. Early in the most recent deglaciation, a reduction in North 
Atlantic overturning circulation seems to have driven CO, release 
from the Southern Ocean?°, but the mechanism connecting the 
North Atlantic and the Southern Ocean remains unclear. Biogenic 
opal export in the low-latitude ocean relies on silicate from the 
underlying thermocline, the concentration of which is affected 
by the circulation of the ocean interior. Here we report a record 
of biogenic opal export from a coastal upwelling system off the 
coast of northwest Africa that shows pronounced opal maxima 
during each glacial termination over the past 550,000 years. 
These opal peaks are consistent with a strong deglacial reduction 
in the formation of silicate-poor glacial North Atlantic intermedi- 
ate water? (GNAIW). The loss of GNAIW allowed mixing with 
underlying silicate-rich deep water to increase the silicate supply 
to the surface ocean. An increase in westerly-wind-driven upwel- 
ling in the Southern Ocean in response to the North Atlantic 
change has been proposed to drive the deglacial rise in atmospheric 
CO, (refs 3, 4). However, such a circulation change would have 
accelerated the formation of Antarctic intermediate water and sub- 
Antarctic mode water, which today have as little silicate as North 
Atlantic Deep Water and would have thus maintained low silicate 
concentrations in the Atlantic thermocline. The deglacial opal 
maxima reported here suggest an alternative mechanism for the 
deglacial CO, release**. Just as the reduction in GNAIW led to 
upward silicate transport, it should also have allowed the down- 
ward mixing of warm, low-density surface water to reach into the 
deep ocean. The resulting decrease in the density of the deep Atlantic 
relative to the Southern Ocean surface promoted Antarctic over- 
turning, which released CO, to the atmosphere. 
Palaeo-oceanographic data show that during Heinrich stadial 1 
(HS1), ~17.5 kyr ago, there was an abrupt decrease in GNAIW forma- 
tion’, widely believed to be caused by a meltwater discharge to the 
high-latitude North Atlantic. At the same time, Antarctica and the 
Southern Ocean warmed and the atmospheric CO, concentra- 
tion rose, factors that are central to the view that changes in the 
Northern Hemisphere triggered those in the Southern Hemisphere, 
leading to the release of CO2 from the Southern Ocean. One proposed 
mechanism is that North Atlantic cooling during HS1 drove the 
Southern Hemisphere westerly winds southwards, increasing the 
upwelling of deep water, which released CO, (refs 3, 4). Increased 
wind-driven upwelling during deglaciations would also have increased 
the formation rates of sub-Antarctic mode water (SAMW) and An- 
tarctic intermediate water (AAIW), which are the main subpolar 
sources for low-latitude thermocline waters. Indeed, a number of 
deglacial observations have been attributed to such a change (see, for 
example, ref. 7). However, other studies have found evidence against 


accelerated SAMW or AAIW flow into the north and south Atlantic 
during HS1*. 

At present, silicate concentrations in SAMW and AAIW are rela- 
tively low. This has been explained as the result of a relatively deep 
dissolution profile for sinking biogenic silica, a high silicate-to-nitrate 
uptake ratio by diatoms, or both. These two factors reduce the silicate 
concentration in Antarctic and sub- Antarctic surface water that is then 
subducted in AAIW and SAMW, respectively"’. The paucity of silicate 
in the modern low-latitude thermocline has inspired the use of bio- 
genic opal export as a measure of the input of silicate-rich deep waters 
into the overlying thermocline”. 

We take the analogous approach of using opal export to reconstruct 
past changes in the communication between the thermocline and the 
deeper ocean. A 550-kyr-long sediment record from the subtropical 
North Atlantic, at Ocean Drilling Program (ODP) Site 658, is analysed 
for changes in opal export. ODP Site 658 (20° 44’ 57’’ N, 18° 34’ 51’’ W, 
2,263-m water depth) is situated in the Mauritanian upwelling cell off 
northwest Africa (Fig. 1). The X-ray fluorescence (XRF) scanning- 
derived element log-ratio of silica to aluminium counts (In(Si/Al)) is 
used as a proxy for sediment opal content. For the last glacial termina- 
tion, we also report new **°Th-normalized opal flux data from nearby 
core MD03-2705 (18° 05’ 49’’ N, 21° 09’ 11’’ W, 3,085-m water depth). 

The In(Si/Al) record is dominated by pronounced peaks during each 
glacial termination (Fig. 2b), reflecting maxima in preserved opal flux 
(Fig. 3e). Additional precession-related cyclicity is observed, which is 
probably due to variations in quartz content caused by climate-related 
changes in grain size (Supplementary Fig. 1). The deglacial peaks, 
however, correspond well to maxima in opal content in the same core’ 
(Fig. 2c). The opal peaks coincide with maxima in overall phytoplank- 
ton productivity as inferred from chlorin concentrations", confirming 
that the sedimentary opal maxima mainly reflect increased production 
and not preservation (Supplementary Discussion 1). Chlorin concen- 
tration peaks are also observed at other times during glacial cycles but 
in those cases are not accompanied by increased opal concentrations. 
This fact, together with the opal peaks not being consistently assoc- 
iated with surface water cooling’® (Supplementary Fig. 2), suggests 
that the opal peaks do not reflect stronger local upwelling but rather 
increased silicate concentration in the upwelled water. Focusing on the 
most recent deglaciation (Fig. 3), we observe two maxima in In(Si/Al) 
and *°°Th-normalized opal flux during HS1 and the Younger Dryas 
stadial. Similar changes in opal production are also seen in other data 
from ODP Site 658, other cores nearby’*”” (Fig. 3) and other sites in the 
North Atlantic’*"’, indicating that the Site 658 opal maxima do not 
record solely local processes. In addition, each opal peak at Site 658 
corresponds to a deglacial maximum in Southern Ocean export pro- 
duction as inferred from Ba/Fe ratios” (Fig. 2d). 

We interpret the deglacial opal peaks as resulting from the mixing of 
silicate-rich abyssal waters into the low-latitude Atlantic thermocline. 
During the Last Glacial Maximum (LGM), Atlantic abyssal water (that 


1Geological Institute, ETH Zurich, 8092 Ziirich, Switzerland. Department of Geosciences, Princeton University, New Jersey 08544, USA. -Rosenstiel School of Marine and Atmospheric Science, University of 
Miami, Florida 33149, USA. “Department of Earth, Ocean, and Atmospheric Sciences, University of British Columbia, Vancouver, British Columbia V6T 124, Canada. °MARUM - Center for Marine 
Environmental Sciences, University of Bremen, 28359 Bremen, Germany. Alfred Wegener Institute, 27568 Bremerhaven, Germany. 
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Figure 1 | Silicate concentration and locations of study sites. Open circle, 
ODP Site 658; filled circles: core MD03-2705 and other sites referred to in the 
text. a, Map of annual average surface silicate concentration”’, indicating the 
scarcity of this nutrient in most of the present-day Atlantic Ocean. 

b, Meridional depth section of silicate concentration in the central Atlantic 


is, water below 2.5 km) apparently had a high silicate concentration, 
similar to that of modern Antarctic bottom water” (AABW) (Fig. 4a). 
The strong reduction in GNAIW during HS1 allowed this silicate-rich 
water to shoal and mix upwards into the thermocline without the 
silicate being swept away by silicate-depleted intermediate-depth flows 
(Fig. 4b). We argue that this is a more plausible scenario than an 
increase in the silicate concentration or formation rate of SAMW or 
AAIW for the following reasons. 

First, modern SAMW and AAIW, although rich in nitrate and 
phosphate, are much lower in silicate than is deep Southern Ocean 
water (~10 and ~25 uM versus 125 |.M, respectively; Fig. 1). During 
glacial terminations, opal accumulation rates at ODP Site658 
increase up to tenfold, from glacial and interglacial background values 
of less than 0.2gcm “kyr ' to more than 2gcm “kyr | during 
termination IV’ (Fig. 2). Even allowing for enhanced opal preservation 
under higher opal fluxes, this dynamic range would have required a 
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Figure 2 | Opal record from ODP Site 658 compared with atmospheric CO, 
concentration and productivity and carbonate preservation in the Southern 
Ocean. a, pCO, record from EPICA Dome C”'; b, In(Si/Al) record from ODP 
Site 658, off Mauritania; c, percentage opal record from Site 658 (ref. 13); d, Ba/Fe 
(green), a proxy for marine export production, and Ca/Fe (blue), a proxy for 
carbonate preservation, from Southern Ocean ODP Site 1094 (ref. 20). The 
deglacial peaks in export production at Site 1094 are interpreted as increased 
nutrient content in upwelled waters, and the carbonate preservation peaks are 
taken to reflect CO, release from the deep ocean. Grey shading indicates glacial 
terminations I to VI, and MIS5 to MIS 15 are interglacial periods. 
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Ocean” (averaged over the area between the two red lines in the inset map), 
illustrating the silicate concentration in major present-day water masses 
(arrows) and the locations of Site 658 and Southern Ocean core TN57-13PC 
(same location as ODP Site 1094). The maps were made using the Ocean Data 
View software”. 


drastic rise in the preformed silicate concentration of SAMW and 
AAIW (Supplementary Discussion 2). The strong wind-driven over- 
turning of the modern Southern Ocean fails to deliver substantial 
concentrations of silicate into SAMW and AAIW today, making such 
a rise unlikely. This view is only strengthened by the early decline in 
dust-borne iron during deglaciations, which may have caused a degla- 
cial increase in the Si/N uptake ratio of Southern Ocean diatoms”, and 
by the early deglacial increase in Antarctic air and Southern Ocean 
temperature”, which should have reduced the capacity of sea ice cover 
to limit silicate consumption by Antarctic diatoms. 

Second, data on changes in the ocean interior favour our explanation 
over increased influence of a high-silicate form of AAIW or SAMW. 
The opal maxima consistently correspond to minima in the °C-to-'7C 
ratio (5'°C) of benthic foraminifera at Site 658 (Supplementary Fig. 2), 
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Figure 3 | Comparison of opal records for the last glacial termination. 

a, °°Th-normalized opal flux from core TN57-13PC (ref. 4); b, Ba/Fe record 
from ODP Site 1094 (ref. 20); c, In(Si/Al) record from ODP Site 658 (Hole A; 
this study); d, ?30Th-normalized opal flux from Site 658 (Hole C; ref. 16); 

e, *°Th-normalized opal flux from core MD03-2705 (this study); f, percentage 
opal in core GeoB7926-2 (ref. 17). The core locations are shown in Fig. la. YD, 
Younger Dryas stadial. 
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Figure 4 | Schematic north-south section of the Atlantic Ocean at different 
stages of the most recent glacial termination. a, The Last Glacial Maximum; 
b, Heinrich Stadial 1; c, the Holocene (current interglacial period). AE, Agulhas 
eddies (inflow of surface water from the Indo-Pacific into the Atlantic); ACC, 
Antarctic circumpolar current; PAZ, polar Antarctic zone; SAZ, sub- Antarctic 
zone. Line colours of interior flows indicate their ventilation source region: 
blue, NADW or GNAIW; yellow, AABW. Line thickness changes among 
panels denote changes in flow rate. Blue shading in the interior reflects the 
concentration of dissolved silicate (Si(OH)4). The black and grey dots indicate 
the locations of ODP Site 658 (this study) and core TN57-13PC (ref. 4), 
respectively, with connected lines representing the source of the upwelled 
water. Purple shading indicates increased polar water column stratification. In 
the LGM state (a), the replacement of NADW by GNAIW was accompanied by 


probably reflecting an increased dominance of nutrient-rich water 
from the deeper Atlantic. During HS1 in both the north and south 
Atlantic, the strongest part of the vertical 5'°C gradient shoaled to 
~1km, with no sign of a focused mid-depth 8'°C anomaly entering 
from the south at the depths of AAIW™, arguing against an increase 
in the rate of AAIW or SAMW inflow to the Atlantic during HS1. 
Consistent with this finding, Cd/Ca data from a South Atlantic site at 
the depth of AAIW show no discernable increase from low glacial Cd/ 
Ca values until after HS1'°. Studies of Cd/Ca and Nd isotopes in the 
Florida straits at the depth of modern SAMW and AAIW (~750 m) 
suggest that the flow of these southern-sourced water masses into the 
North Atlantic was low during the last ice age and did not increase until 
after HS1, during the Bolling—Allerod warm period, with a clear inter- 
ruption during the Younger Dryas*”. In fact, the Nd isotope data (and 
less strongly, the Cd/Ca data) suggest a decline in AAIW penetration 
during HS1 relative to LGM conditions. Because modern SAMW and 
AAIW are incorporated into North Atlantic Deep Water (NADW; 
Fig. 4c), the apparent restart of NADW during the Bolling-Allerad 
warm period has been proposed as the cause for increased SAMW or 
AAIW flowinto the North Atlantic at that time’. We note that a number 
of studies interpret mid-depth changes during HS1 (in Cd/Ca, Si iso- 
topes, Nd isotopes and water-mass radiocarbon content) as resulting 
from increased AAIW formation, but these data are equally well, or 
better, explained by increased upward incursion of the underlying 
southern-sourced abyssal water (Supplementary Discussion 2). 

Third, in the Pacific Ocean, deglacial opal flux maxima have been 
observed during some but not all of the past five glacial terminations”. 
According to our proposed mechanism and as observed during the 
most recent deglaciation, the deglacial reduction in GNAIW should 
have affected intermediate-depth circulation in the Atlantic more 
strongly than in the Pacific. In contrast, if the opal peaks were the 
result of silicate-rich deglacial forms of SAMW and AAIW, they 
should have been as robust in the Pacific as in the Atlantic. 

One observation that our proposed deglacial mechanism does not 
explain is the continued deposition of opal at ODP Site 658 through 
the early interglacial period of Marine Isotope Stage 9 (MIS 9; Fig. 2b). 
NADW formation was probably active during MIS9, which under 
normal circumstances would have isolated the thermocline from the 
high silicate concentration of southern-sourced deep waters. An intri- 
guing possibility for the cause of the high interglacial opal flux of MIS 9 
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shoaling of the upper boundary of silicate-rich southern-sourced deep water. 
During early deglaciation (HS1; b), a reduction in GNAIW formation in the 
North Atlantic allowed the upward mixing of abyssal water rich in silicate (blue 
wavy arrows) to increase the silicate concentration of intermediate and 
thermocline waters (light blue shading), resulting in the observed peaks in opal 
production at Site 658 and in TN57-13PC. The simultaneous downward 
mixing of warmth and, thus, buoyancy (red wavy arrows) reduced the density 
of deep water, stimulating increased formation of AABW and leading to CO, 
release through the PAZ. The interglacial (Holocene) state (c) is characterized 
by strong deep and intermediate water flows from both north and south, which 
transport away the silicate mixed up from southern-sourced abyssal water, 
preventing it from reaching the low-latitude thermocline. 


is that NADW flowed below, rather than above, all southern-sourced 
waters. That is, AABW and AAIW may have been merged as a single 
mid-depth water mass during MIS 9, with NADW below it. MIS9 
had the highest pCO) levels of recent interglacials, which may have 
rendered the high latitudes unusually warm relative to the global mean. 
These conditions would have been especially effective at reducing the 
density of AABW, because its formation is probably more sensitive to 
polar temperatures than is that of NADW, which forms at lower 
latitude and is also more strongly tied to salinity. 

The reconstructed deglacial opal maxima in the subtropical North 
Atlantic clarify multiple aspects of the mechanism of deglaciation. 
Because GNAIW formation and export must have decreased drastic- 
ally to produce these pronounced peaks in opal flux, their systematic 
occurrence indicates that the early reduction in GNAIW formation isa 
universal aspect of deglaciations and is very probably central to the 
sequence of events underlying glacial terminations. Furthermore, the 
evidence against an increase in SAMW or AAIW formation makes our 
observations inconsistent with the ‘wind-shift’ hypothesis for the early 
deglacial CO, release**. The increased Antarctic Ekman upwelling that 
has been proposed to drive the CO; release is physically coupled to the 
equatorward transport of Antarctic surface waters and ‘Ekman pump- 
ing’ farther north*®. An increased northward flow would have gone 
into the formation of additional SAMW or AAIW, or some shallower, 
even more silicate-deplete, subsurface water mass. Any of these con- 
tributions to the upper water column of the low-latitude Atlantic would 
have increased the vertical separation between the low-latitude surface 
and the silicate-rich deep water, counteracting the upward penetration 
of silicate into the thermocline that is required to generate the deglacial 
opal peaks in the low-latitude Atlantic. In addition, stronger Southern 
Hemisphere westerly winds during HS1 would have steepened the iso- 
pycnals in the subpolar Southern Hemisphere, thickening the Atlantic 
thermocline”. This again would have worked to separate the shallow 
tropical Atlantic from silicate-rich deep water, countering the mecha- 
nisms required to generate the observed opal maxima. 

In Antarctic sediments, the higher interglacial concentrations of 
opal and Ba (as in the Ba/Fe record of ODP Site 1094; Fig. 2d) compared 
with glacial values are interpreted as evidence of increased Antarctic 
overturning’. As a corollary, the deglacial opal maxima observed in 
core TN57-13PC (cored at the same location as Site 1094; Fig. 3a) have 
been taken as indicating transient deglacial accelerations in Antarctic 
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overturning that then subsided to modestly increased overturning 
during the full interglacials*. However, the pronounced deglacial opal 
maxima of TN57-13PC very probably share the same cause as those at 
ODP Site 658: a deglacial increase in the silicate concentration of 
Atlantic mid-depth waters. At the site of TN57-13PC, the water at 
~200-m depth, which is a major source of the nutrients imported into 
the euphotic zone, is on a density surface that today sits in the upper 
half of NADW in the low-latitude Atlantic (~1,800 m; ref. 27). The 
reduction in GNAIW formation during HS1 and the Younger Dryas, 
by allowing AABW to shoal, would have increased the silicate concen- 
tration on this density surface throughout the Atlantic. In this light, 
the persistent occurrence of deglacial export production maxima at 
Site 1094 (Fig. 2d) provides a further indication of the systematic loss of 
North Atlantic ventilation at glacial terminations. 

Lacking the westerly-wind-driven mechanism, we are left with the 
question of how the deglacial reduction in GNAIW led to an increase 
in Antarctic overturning and CO, release. The deglacial opal maxima 
provide support for a mechanism that was first raised in the context of 
the Younger Dryas®: just as the loss of GNAIW would have allowed 
the upward transport of silicate from the abyssal Atlantic to the low- 
latitude thermocline, it also would have permitted the downward 
mixing of warm surface water to reach the southern-sourced deep 
waters (Fig. 4b, red wavy arrows). The resulting deep ‘density vacuum’, 
that is, a reduction in the density of deep water that stimulates its 
replacement by new, denser water, would have encouraged deep-water 
formation in the Antarctic, releasing CO, to the atmosphere’ (Fig. 4b). 
Early deglacial warming in the ocean interior has been documented”. 
Although this warming is plausibly interpreted to indicate an Antarctic 
trigger for deglaciation’’, most records suggest that Antarctic air tem- 
perature increased later”*”’. Thus, an alternative view is that the HS1 
shutdown in North Atlantic ventilation led to the ocean interior warm- 
ing, which, in turn, triggered Antarctic overturning, causing Antarctica 
to warm and the atmospheric CO, concentration to rise. 


METHODS SUMMARY 


Elemental ratios at ODP Site 658 were determined by XRF scanning of the split 
core surface using an Avaatech core scanner. Measurements were taken continu- 
ously at a resolution of 1 cm. Cores from all three parallel holes of Site 658 were 
used and a new composite depth was derived from the XRF data. The age model 
for Site658 was constructed on the basis of oxygen isotope stratigraphy and 
additional matching of scanning XRF data to published records from Site 658 
and a nearby core (Supplementary Fig. 3). For core MD03-2705, the published 
age model was used. We derived *°°Th-normalized opal flux data for core MD03- 
2705 from biogenic opal concentrations and excess **’Th activities, determined by 
isotope dilution and measurement on a single-collector inductively-coupled- 
plasma sector-field mass spectrometer. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

XRF scanning (ODP Site658). The scanning XRF measurements at ODP 
Site 658 were performed with an Avaatech XRF Core Scanner at MARUM, Uni- 
versity of Bremen. Data were obtained at a resolution of 1 cm over an area of 
1.2 cm” directly at the split core surface of the archive half, with different settings 
for light elements such as Al and Si (10 kV, 20s, and 150 mA) and heavy elements 
such as Zr (50 kV, 20s and 80 mA). The core surface was covered with 4-um-thick 
SPEXCerti Prep Ultralenel foil to avoid contamination of the XRF measure- 
ment unit. The core scanner includes a Canberra X-PIPS Silicon Drift Detector 
(SDD; model SXD 15C-150-500) with 150-eV X-ray resolution, a Canberra Digital 
Spectrum Analyser DAS 1000 and an Oxford Instruments 100W Neptune X-ray 
tube with rhodium (Rh) target material. Raw data spectra were processed using the 
WIN AXIL (analysis of X-ray spectra by iterative least squares) package from 
Canberra Eurisys. Reported element count ratios were calculated from XRF signal 
intensities. 

To bridge coring gaps and hiatuses, cores from all three parallel holes (A, B, and 
C) were used (Supplementary Fig. 2) and a new composite depth was derived from 
the XRF data. In cores from Hole C, the water content of the sediment is higher 
than in cores from the other two holes because the Hole C cores were frozen after 
drilling to preserve organic molecules better and re-thawed much later in the core 
repository, leaving them less time to dry out. Because water content can affect the 
XRF measurements of light elements such as Al, we used this hole only where 
material from the other two holes was unavailable, and adjusted the Zr/Al and Si/ 
Al ratios from Hole 658C by regression to Zr/Al and Si/Al ratios from Hole 658B 
for a section with overlapping data. 

Grain size correction. Grain size at ODP Site 658 and in nearby cores has been 
shown to vary owing to changes in local climate'*”’. Minerals such as quartz (SiO2) 
and zircon (ZrSiO,) are resistant to weathering and are hence mainly associated 
with coarser grain sizes. Grain size therefore affects the abundance of Si and 
zirconium (Zr) relative to Al, which is mainly associated with clay minerals. At 
Site 658, In(Zr/Al) exhibits very similar variability to In(Si/Al) (Supplementary 
Fig. 1), except for the pronounced deglacial peaks. These variations also corre- 
spond well to the published low-resolution grain size record from the same core” 
(Supplementary Fig. 1). The precession-related variability in In(Si/Al) therefore 
probably reflects the abundance of quartz, not opal. When subtracting the two 
normalized XRF-derived time series, the grain-size-corrected (that is, quartz- 
corrected) In(Si/Al) record agrees well with the published low-resolution opal 
record’* (Supplementary Fig. 1). 

?3°Th normalized opal flux (core MD03-2705). Normalized opal fluxes were 
calculated by multiplying the opal fraction of the sediment by the ratio of expected 
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flux of *°Th scavenged from the water column to observed initial excess *°Th 
activities. Biogenic opal was determined by alkaline extraction of silica** and 
measurement of silica concentrations in the extract by molybdate-blue spectro- 
photometry. Replicate analyses indicate a reproducibility of +3%. °°Th was 
measured by isotope dilution on a Finnigan 144 MAT Element 2 single-collector 
inductively-coupled-plasma sector-field mass spectrometer. The sediment samples 
(~0.3 g) were ground and spiked with ?°°Th before total dissolution in HNO3, HE 
and HCIO,. An aliquot of the resulting solution was spiked with **°U and ””°Th to 
measure **°U and *’Th. The remaining solution was used for **°Th separation 
by ion-exchange chromatography. Initial excess **°Th activities were obtained by 
correcting for the supported detrital *°Th assuming a ***U/*’Th ratio of 0.7, 
ingrowth from authigenic uranium and radioactive decay since deposition. The 
expected excess **°Th activity was based on its production rate from 7**U in the 
water column, given the water depth of the coring site. 

Age models. The age model for the ODP Site 658 record was derived by transfer- 
ring the published benthic 5'*O record*! onto the new composite depth and tuning 
to the same data on the age scale used in the LR04 benthic stack” (Supplementary 
Fig. 3). For the past 120 kyr, the age model was additionally refined by matching 
XRF Ca counts from Hole 658A and Hole 658B to percentage CaCO; data from 
Hole 658C, which has been radiocarbon-dated (ref. 36) (0-15 kyr ago), and by 
matching Zr/Al ratios to the grain-size-derived humidity index from neighbouring 
core GeoB7920-2 (ref. 32) (30-120 kyr ago). All data matching was performed 
using the software AnalySeries’’. For core MD03-2705, the published age model”* 
was used. 
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Electrical image of passive mantle upwelling beneath 
the northern East Pacific Rise 


Kerry Key’, Steven Constable, Lijun Liu? & Anne Pommier? 


Melt generated by mantle upwelling is fundamental to the produc- 
tion of new oceanic crust at mid-ocean ridges, yet the forces con- 
trolling this process are debated’. Passive-flow models predict 
symmetric upwelling due to viscous drag from the diverging tec- 
tonic plates, but have been challenged by geophysical observations 
of asymmetric upwelling®* that suggest anomalous mantle pres- 
sure and temperature gradients’, and by observations of concen- 
trated upwelling centres* consistent with active models where 
buoyancy forces give rise to focused convective flow’. Here we 
use sea-floor magnetotelluric soundings at the fast-spreading 
northern East Pacific Rise to image mantle electrical structure to 
a depth of about 160 kilometres. Our data reveal a symmetric, high- 
conductivity region at depths of 20-90 kilometres that is consistent 
with partial melting of passively upwelling mantle’. The triangu- 
lar region of conductive partial melt matches passive-flow predic- 
tions, suggesting that melt focusing to the ridge occurs in the 
porous melting region rather than along the shallower base of 
the thermal lithosphere. A deeper conductor observed east of the 
ridge at a depth of more than 100 kilometres is explained by asym- 
metric upwelling due to viscous coupling across two nearby trans- 
form faults. Significant electrical anisotropy occurs only in the 
shallowest mantle east of the ridge axis, where high vertical con- 
ductivity at depths of 10-20 kilometres indicates localized porous 
conduits. This suggests that a coincident seismic-velocity anomaly” 
is evidence of shallow magma transport channels’*"* rather than 
deeper off-axis upwelling. We interpret the mantle electrical struc- 
ture as evidence that plate-driven passive upwelling dominates this 
ridge segment, with dynamic forces being negligible. 

We collected magnetotelluric data on the northern East Pacific Rise 
(EPR) to study the electrical structure of mantle upwelling, melt gene- 
ration and magma transport to the crust at a mid-ocean ridge (Fig. 1). 
Magnetotelluric data record naturally occurring low-frequency elec- 
tromagnetic variations that sense electrical conductivity, a physical 
property that primarily depends on temperature, mineralogy and 
the presence of fluids and their interconnectedness’*"'. The upwelling 
region beneath mid-ocean ridges is a particularly suitable target for 
magnetotelluric exploration, because small fractions of conductive 
partial melt can greatly increase bulk conductivity from the generally 
low conductivity of unmelted peridotite*!°"""”. 

The fast spreading EPR separates the Pacific and Cocos plates, which 
diverge symmetrically to produce new sea floor at a half-spreading rate 
of 55km Myr ', while the entire ridge system migrates to the north- 
west at 50km Myr ‘ relative to a hotspot reference frame'® (Fig. 1). 
The 8-11° N EPR ridge segment became a locus of scientific investi- 
gation after the first direct observations of extensive sea-floor volca- 
nism'’. Vigorous hydrothermal venting’*® and a continuous crustal 
magma chamber” suggest this ridge segment taps a robust mantle melt 
supply, a notion reinforced by a sea-floor eruption that buried several 
actively recording seismometers”. Seismic tomography shows the 
shallowest mantle at 9 km depth contains a low P-wave-velocity zone 
that follows the ridge crest along most of this segment, but which 


deviates 5-10 km east of the ridge near our magnetotelluric profile 
(Fig. 1), suggesting that melt delivery to the base of the crust is locally 
focused eastward here’’. Despite a multitude of studies at this ridge 
segment, the deeper mantle upwelling region remained unexplored by 
a large geophysical array until this magnetotelluric study. 

Our survey relied on broadband magnetotelluric instrumentation 
that is sensitive to the high frequencies required to constrain the shal- 
low mantle’’ and that requires only 10-20 days of recording time, 
making array surveys possible with a single research cruise. The 
200-km profile of 29 magnetotelluric stations contains a broad aper- 
ture suitable for imaging mantle upwelling but also has closely spaced 
stations near the ridge axis to constrain melt focusing to the base of the 
crust (Fig. 1). For comparison, the earlier MELT experiment obtained 
sensitivity to deeper upwelling structure using year-long deployments 
of 19 long-period instruments along a more sparely sampled 400-km 
transect®. We used regularized nonlinear data inversion to produce a 
smooth image of two-dimensional electrical resistivity constrained to 
about 160 km depth and to 100 km distance from the ridge axis (Fig. 2). 

The inversion model reveals a prominent conductive region 
(<100Qm resistivity) centred beneath the ridge axis. Conductivity 
(the reciprocal of resistivity) is greatest at depths of about 20-90 km 
and is largely symmetric about the ridge axis. The top of the conductive 
region displays a broad triangular shape that deepens away from the 
ridge, reaching depths of about 80-100 km at 100 km distance. Above 
the sides of this conductor, the mantle becomes significantly more 
resistive (100-2,000 Q m). The deeper mantle, at depths >100 km, is 
less resistive than 60 Q m, but is asymmetric with resistivity as low as 
10Q m to the east. Model perturbation tests show that the central and 
deeper conductive regions are required to obtain a good fit to the data 
(see Supplementary Figs 7 and 8), although the deeper conductor is less 
well constrained owing to its location near the edge of the magneto- 
telluric array and its depth. Anisotropy is insignificant throughout 
most of the model, except beneath the ridge at 10-20 km depth and 
0-10 km position, where conductivity is greatest in the vertical direc- 
tion (Supplementary Fig. 5). 

These conductive regions indicate that the upwelling mantle is 
undergoing significant partial melting. Mineral physics studies find 
that upwelling will trigger decompression melting of dry peridotite 
at a depth of about 65 km, where the ridge adiabat crosses the perido- 
tite solidus’. However, the deeper extent of our conductive region 
agrees with experimental studies of mid-ocean-ridge basalt samples 
that show the upper-mantle source region is slightly damp, with 50- 
200 parts per million (p.p.m.) H2O (refs 21, 22), resulting in a deeper 
peridotite solidus that promotes melting to around 100 km depth’, 
depending on the amount of H,O present. The presence of dissolved 
CO, has also been reported in mid-ocean-ridge contexts, and is 
thought to stabilize carbonated silicate partial melts at even greater 
depths*, and the combination of CO, and H,O may further enhance 
deep silicate melting”. 

Recent laboratory electrical conductivity studies have shown that 
small amounts of dissolved H,O and CO, will increase the conductivity 
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of mantle partial melt’®"’. Because these volatiles are preferentially 
partitioned into melt, the earliest deep melts will be the most con- 
ductive, whereas later shallower melts will have lower volatile fractions 
and hence display a conductivity that is transitional between the 
volatile-influenced and dry endmembers'’. The low resistivity of 
1-10Qm in the main conductive body is consistent with up to a 
few volume per cent partial melt, but the highly conductive core of 
the anomaly (0.3Qm) requires a significantly higher melt fraction 
(>10 vol.%)'°"". However, we warn against over-interpreting this high 
conductivity, because magnetotelluric data are primarily sensitive to 
the integrated conductance of conductive bodies; the inversion may be 
trading-off a narrow size for this core region with a compensating 
higher conductivity. 

Because carbonate melts are much more conductive than silicate 
melt", the 10-60 Q m resistivity beneath 100 km depth can be explained 
by a much lower amount of carbonate partial melt (<0.5 vol.%) that is 
stable at this depth. The moderately resistive gap separating the two 
conductive regions at around 100 km depth is intriguing, because the 
deeper carbonate melts should be migrating upward and enhancing 
conductivity there. However, experimental results have shown that 
when both silicate and carbonate melts are present, the solid matrix is 
preferentially wetted by silicate melt, and therefore carbonate would not 
necessarily be well connected”*; it may also be diluted by the less con- 
ductive silicate melt''. We suggest that this gap represents the depth 
where silicate melt begins to dominate over carbonate melt, leading to 
an overall lower conductivity. 

The melting region at depths shallower than 100 km in our study is 
unusually symmetric compared to geophysical observations at other 
mid-ocean ridges'**, but is juxtaposed with deeper asymmetric melt- 
ing below 100 km. Although viscous drag from symmetrically diver- 
ging tectonic plates is thought to be the main driving force in mantle 
upwelling, asymmetric observations have required additional mechan- 
isms in order to generate melt unevenly across the ridge. A strong 
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Figure 1 | Location of the magnetotelluric survey across the fast spreading 
East Pacific Rise. Twenty-nine sea-floor magnetotelluric stations (white 
circles) were deployed across the ridge axis at 9° 30’ N, about 1,000 km 
southwest of Central America (inset; study area boxed in red). The Pacific and 
Cocos plates diverge symmetrically (black arrows) while the entire ridge system 
migrates to the northwest relative to a fixed hotspot reference frame’® (grey 
arrow). The Clipperton and Siqueiros transform faults (TF) bound this ridge 
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westward asymmetry observed at the southern EPR was conjectured 
to arise from mantle pressure or temperature gradients®’. Localized 
centres of upwelling observed in the Gulf of California do not align 
with the heavily segmented ridge axis, suggesting that melt may be 
dynamically concentrated through buoyant convective flow’. 

We carried out a three-dimensional flow simulation to study 
whether the deep asymmetry in our model could be explained solely 
by passive flow, or requires the addition of dynamic forces or lateral 
temperature and pressure gradients. Absolute plate motions were 
imposed as a boundary condition ona simulation domain that included 
both the Clipperton and Siqueiros transform offsets. Figure 3 shows the 
vertical velocity component of flow induced by the plate motions. In the 
upwelling mantle, melting occurs in proportion to the vertical velocity 
due to decompression but also depends on composition and adiabatic 
temperature. Because the melting rate for hydrous, carbonated perido- 
tite is poorly known, we use vertical velocity as a proxy indicator for 
where deep incipient melts could be generated. 

The deep upwelling shows an asymmetry between the transform 
offsets, with vertical velocities of 0.5-2.5 cm yr _' biased to the east of 
the ridge axis at 100-250 km depth. Although we included westward 
ridge migration in the simulation via the absolute plate motions, it 
does not generate a significant asymmetry in melt production because 
migration only superimposes a horizontal shear to the deep flow®”. 
This is further supported by the lack of asymmetry in the simulation 
far north of the Clipperton transform and far south of the Siqueiros 
transform. Rather, the deep asymmetry arises from viscous coupling 
across the transform offsets, which is intensified because both offsets 
jump to the east and are in relatively close proximity. This upwelling 
asymmetry will generate more melt to the east, offering an explanation 
for our observed asymmetry. However, our model contains deep car- 
bonate melts extending much further off-axis than the peak in upwel- 
ling. Because the magnetotelluric inversion is imaging the amount of 
partial melt present rather than the melting rate implied by the vertical 
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segment to the north and south. Slow seismic P-wave velocity contours 
(velocities <7.6kms ') found by seismic tomography of the uppermost few 
kilometres of mantle’? follow the ridge crest along most of the segment, but 
deviate to the east near the magnetotelluric profile. The colour scale shows sea- 
floor topography and the 100-km scale bar indicates the half-aperture of the 
magnetotelluric array. 
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Figure 2 | Magnetotelluric resistivity image of mantle upwelling beneath 
the East Pacific Rise. Top, surface view of sea-floor topography. Main panel, 
colours show log[resistivity (Q m)] in the vertical direction obtained from 
nonlinear inversion of data from the sea-floor magnetotelluric stations 
(inverted triangles in surface view). Green to red colours indicate high 
conductivity (low resistivity) due to partial melts generated in the upwelling 
mantle. The full anisotropic resistivity results are shown in Supplementary 
Fig. 5. Solidus depths for dry and wet peridotite’ are shown as horizontal lines. 


velocity, it is possible that this conductor represents melt that has 
ascended from greater depths; if deep off-axis melt is partially 
entrained in the mantle flow, it may be transported further to the east 
of the ridge axis owing to the westward ridge migration. 

Upwelling at depths shallower than 100 km is intensive and sym- 
metric beneath the ridge axis, in agreement with the prominent zone of 
silicate melt indicated by the high conductivity in our model (Fig. 3b). 
Despite the smoothing penalty applied during the data inversion, 
our model shows a rapid resistivity increase that coincides with the 
sloping boundaries of the upwelling mantle. Along this boundary, the 
flow pattern turns from upwelling to horizontal flow as the mantle 
begins to be dragged off-axis. Because the high resistivity agrees with 
that of melt-free dry olivine’’, this transition away from the inferred 
melting region is consistent with the closure or compaction of the 
porous melt network generated by upwelling. This implies that buoy- 
ant melt is focused to the ridge axis by the shape of the porous melting 
region, rather than by upward migration to the base of the much 
shallower thermal lithosphere defined by the half-space cooling model 
(Fig. 3b). 

The shallow uppermost mantle above 20km depth is predomi- 
nantly resistive, suggesting it is largely unmolten with a negligible 
porosity. However, moderately low vertical resistivity spanning from 
the ridge axis to 10km to the east is consistent with a vertical melt 
channel connecting from the deeper melting region to the base of the 
crust (Fig. 2). This feature occurs where seismic tomography found a 
low-velocity anomaly directly beneath the crust’”. Our model suggests 
that the off-axis velocity anomaly is due to a shallow melt transport 
channel or network of channels’’ rather than reflecting deeper upwel- 
ling asymmetry. Highly porous transport channels could allow for 
rapid magma ascent to the crust’. This may explain why the high 
conductivity imaged at depths >20km does not extend all the way 
to the base of the crust beneath the ridge axis. Magma generated in the 
upwelling mantle may reach a critical depth near the top of the melting 
region where it is efficiently transported to the crust through porous 
channels that arise episodically in space and time. 
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Figure 3 | Three-dimensional passive-flow simulation of the EPR 
kinematics, including ridge migration, sea-floor spreading and the 
Clipperton and Siqueiros transform offsets. a, Map view showing the 
imposed absolute plate motions of the Pacific and Cocos plates (black arrows) 
and the simulated upwelling vertical velocity at 150 km depth (shaded colours). 
b, Cross-section of vertical velocity along profile A-A’ (in a), which crosses the 
ridge axis at the same position as the magnetotelluric profile. The magenta line 
contours 1cmyr ’ vertical velocity, highlighting the deep asymmetry to the 
east (right). Black lines delineate 0.1, 1 and 10 Q m resistivity contours from the 
magnetotelluric inversion model (Fig. 2). White lines show isotherms for a half- 
space cooling model. 


METHODS SUMMARY 


We used a robust array processing method” to estimate magnetotelluric responses 
from the raw time series and inverted the responses for triaxially anisotropic 
conductivity using nonlinear two-dimensional inversion implemented with an 
adaptive finite element algorithm””*. Flow modelling of mantle upwelling due 
to absolute plate motions was accomplished using a finite element approach. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Magnetotelluric data analysis. The 29 broadband magnetotelluric stations were 
deployed during a single month-long cruise, yielding frequency domain magne- 
totelluric responses in the 20-5,000 s period band. The station names are given in 
Supplementary Fig. 1 and the magnetotelluric data are shown in Supplementary 
Figs 2 and 3. Magnetotelluric impedance skews”' for each station are displayed in 
Supplementary Fig. 4. Low skews from the ridge axis to the west are consistent with 
a predominantly two-dimensional conductivity beneath the ridge. Many stations 
on the Cocos plate have large impedance skews at periods of 100-1,000s, indi- 
cating three-dimensional conductivity effects. These large skews are probably due 
to the 85 km lateral offset of ridge axis, and by inference the conductive upwelling 
region, across the Clipperton transform, which is located only 75 km north of the 
magnetotelluric profile. For our two-dimensional analysis, we omitted all trans- 
verse-electric (TE) data (that is, ridge-parallel electric fields) from sites s48 to s60 
and only included transverse magnetic (TM) data (that is, ridge-perpendicular 
vertical current loops), because TM data are considered more reliable for two- 
dimensional interpretation in the presence of three-dimensional effects*’’. 

Nonlinear anisotropic inversion. We modelled the data using a regularized non- 
linear inversion that solved for anisotropic electrical resistivity with a triaxial form: 


px 0 0 
p=|0 py, 0 (1) 
0 0 Pp, 


where the x direction is aligned with the ridge axis, y is ridge perpendicular and 
z points down. The inversion solved for the three unknown resistivity tensor 
elements on a grid of 7,728 triangular cells spanning from the sea floor into the 
mantle, with small cells conforming to the rugged sea-floor bathymetry and larger 
cells at depth. The total number of unknown parameters was 23,184. 

The inversion follows the usual Occam’s approach by minimizing the following 
functional”®: 


U=||dm|? + n~ ||| W(d—F(m))||? (2) 


The first term measures the model roughness. The second term measures the fit of 
the model m (logo resistivity) to the observed data d, weighted by the inverse data 
standard errors in the diagonal matrix W, where F is the forward modelling 
operator and ||...|| is the two-norm. Occam uses an automatic search method to 
find the largest trade-off parameter ju that fits the data, thereby seeking the 
smoothest model fitting the data. For the triaxially anisotropic resistivity consi- 
dered here, the model roughness term is expanded into the form: 


|| Ora? = |]Omg[°+|| Gm, |||] Om_|°+|| mx — my ||"+|| my —me|| 4] mz — mel? (3) 


The first three terms measure the spatial roughness of each anisotropic resistivity 
component by differencing the values for adjacent resistivity parameters. The 
last three terms measure the degree of anisotropy within each cell so that the 
inversion only generates anisotropy where it is required to fit the observed data. 
Equation (3) has an implicit weighting of unity between the spatial roughness and 
anisotropic roughness measures so that neither is given preferential treatment by 
the inversion. 
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We applied a 10% error floor to the data and then ran the inversion until it 
found a minimum root-mean-squared (r.m.s.) misfit of 1.73. We then restarted the 
inversion and ran it toa target r.m.s. misfit that was 10% larger (1.90), to ensure the 
data were not being over-fitted. The inversion was started from a uniform 10 Qm 
model. Supplementary Fig. 5 shows the resulting anisotropic inversion model and 
Supplementary Fig. 3 shows the corresponding model fits to the data. 

We also carried out an inversion of only the TM mode data to examine differ- 
ences in the inverted model structure as a function of data constraints (Sup- 
plementary Fig. 6). This inversion finds a deep asymmetric conductor below 
100 km depth that is similar to the previous inversion of both the low-skew TE 
mode data and all TM data (Supplementary Fig. 5). The TM-only inversion 
similarly finds a shallower ridge-centred melting region, but its conductivity is 
much lower than the previous inversion that included TE data. This is not sur- 
prising because the TM mode is known to be less sensitive to the conductivity of 
two-dimensional conductors than the TE mode, but is more sensitive to the lateral 
edges of conductors than the TE mode*'. 

Model sensitivity studies. We tested the sensitivity of the data to both the shallow 
(A) and deep (B) conductors and the sub-crustal vertical conductor (C) by erasing 
each feature and computing the corresponding change in data fit. Supplementary 
Fig. 7 shows the outline of these conductive regions, where the resistivity was 
increased to 100Qm (A and B) and 200Qm (C). Supplementary Fig. 8 shows 
the increase in the r.m.s. data misfit for each test. Removing the shallow conductor 
A resulted in an r.m.s. increase of 400% for the TE data and 20% for the TM data, 
showing that it is strongly required by the data. Removing the deeper conductor B 
resulted in a change of only 20% in the TM mode data fit, showing that it is less well 
constrained by the data, yet still required to obtain a good overall fit. Removing the 
vertical conductor C resulted in a 3% change in the TM mode fit; while this is a 
small change, individual sites have up to 9% worse data misfits when the conductor 
is removed, suggesting it is a necessary feature of the model. 
Three-dimensional passive-flow modelling. We constructed a three-dimensional 
flow model simulating the kinematics of the EPR transform faults near 8° and 
10° N using a finite element method*’. The model domain spans 12° (latitude), 24° 
(longitude) and 1,000 km (depth), where the long enough east-west dimension of 
the box avoids the edge effects. A high mesh resolution (~7 km) was selected so 
that the flow pattern does not depend on the grid size. Plate motions were imposed 
as surface velocity boundary conditions, with a full spreading rate of 11cm yr ' 
and a northwestward ridge migration rate of 5 cm yr’ ', according to NUVELIA“. 
Sea-floor ages were used to prescribe the upper thermal boundary layer, with zero 
age at the spreading centre, which increases to 30 Myr towards the edges of the box. 
A temperature- and depth-dependent viscosity structure was used, with the mean 
viscosities at depths of lithosphere, upper mantle, transition zone and lower mantle 
corresponding to 100, 0.01, 1, 30 (relative to 10°" Pas), respectively. 
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Tubicolous enteropneusts from the Cambrian period 


Jean-Bernard Caron’, Simon Conway Morris” & Christopher B. Cameron? 


Hemichordates are a marine group that, apart from one mono- 
specific pelagic larval form, are represented by the vermiform enter- 
opneusts and minute colonial tube-dwelling pterobranchs. Together 
with echinoderms, they comprise the clade Ambulacraria’. Despite 
their restricted diversity, hemichordates provide important insights 
into early deuterostome evolution, notably because of their pha- 
ryngeal gill slits’. Hemichordate phylogeny has long remained 
problematic**, not least because the nature of any transitional form 
that might serve to link the anatomically disparate enteropneusts 
and pterobranchs is conjectural. Hence, inter-relationships have 
also remained controversial. For example, pterobranchs have 
sometimes been compared to ancestral echinoderms’. Molecular 
data identify enteropneusts as paraphyletic, and harrimaniids** 
as the sister group of pterobranchs. Recent molecular phylogenies 
suggest that enteropneusts are probably basal within hemichor- 
dates, contrary to previous views’, but otherwise provide little guid- 
ance as to the nature of the primitive hemichordate”*. In addition, 
the hemichordate fossil record is almost entirely restricted to peri- 
dermal skeletons of pterobranchs, notably graptolites'°"'. Owing 
to their low preservational potentials, fossil enteropneusts are 
exceedingly rare’*"’, and throw no light on either hemichordate 
phylogeny or the proposed harrimaniid-pterobranch transition. 
Here we describe an enteropneust, Spartobranchus tenuis (Walcott, 
1911), from the Middle Cambrian-period (Series 3, Stage 5) Burgess 
Shale. It is remarkably similar to the extant harrimaniids, but differs 
from all known enteropneusts in that it is associated with a fibrous 
tube that is sometimes branched. We suggest that this is the precursor 
of the pterobranch periderm, and supports the hypothesis that 
pterobranchs are miniaturized and derived from an enteropneust- 
like worm®®. It also shows that the periderm was acquired before 
size reduction and acquisition of feeding tentacles, and that coloni- 
ality emerged through aggregation of individuals, perhaps similar 
to the Cambrian rhabdopleurid Fasciculitubus''. The presence of 
both enteropneusts and pterobranchs in Middle Cambrian strata, 
suggests that hemichordates originated at the onset of the Cambrian 
explosion. 


Phylum Hemichordata Bateson, 1885 
Class Enteropneusta Gegenbaur, 1870 
Spartobranchus tenuis gen. nov. 


Etymology. Sparte, Greek for cord or rope, and branchos for gills. 
Lectotype. USNM 108494 

Referred material. Paralectotypes USNM 553526 (see Supplemen- 
tary Information, section 1), in addition, 5,277 specimens of the 
Smithsonian Institution National Museum of Natural History (SI- 
NMNH), 3,608 of the Royal Ontario Museum (ROM) and 128 of the 
Geological Survey of Canada (GSC). 

Locality. Walcott Quarry in Yoho National Park, British Columbia, 
Canada. 

Horizon. Walcott Quarry Member, Burgess Shale Formation, Lower 
Chancellor Group. 

Diagnosis. Vermiform, maximum length 10 cm. Body is comprised of 
a short proboscis, collar, pharyngeal area with up to 40 pairs of gill and 


tongue bars, and elongate posterior trunk ending in a bulbous unit. 
Gut straight, anus terminal. Approximately one-quarter of the spe- 
cimens are associated with a tube that has organic walls. This tube 
varies from straight, completely circular to helicoidal, and sometimes 
branching. No more than one individual per tube has been observed. 

S. tenuis was originally classified as Ottoia tenuis, Walcott, 1911 
(ref. 16). Conway Morris’” removed this taxon from Ottoia (type spe- 
cies Ottoia prolifica (Walcott'’)), which is a priapulan’’, and in passing 
drew attention to its similarity to enteropneusts. Specimens of S. tenuis 
are preserved as flat carbonaceous compressions. They often occur in 
large clusters (Supplementary Information, sections 2 and 4), and this 
taxon is one of the most abundant species in the Walcott Quarry 
community’’”°. A possible related form from the Middle Cambrian 
Wheeler Formation, Utah, United States (Spartobranchus sp.; Sup- 
plementary Information, section 3). 

The body is flexible, consisting of a short proboscis, collar and 
narrow elongate trunk terminating in a bulbous structure. The largest 
complete individuals are approximately 10 cm long with the proboscis 
accounting for less than 5% of the total body length. The trunk width 
represents less than 2% of the total length. The proboscis is usually 
rounded (Figs la, b and 2a-c), triangular (Figs 1c, d, f, g, i-l and 
2d-e, h-j) or, more rarely, elongate (Fig. le). Internal features of the 
proboscis include a darker axial zone (Fig. 1a, e, f) and a posterior 
crescent structure (Fig. 1b, d). These are consistent with the position of 
the kidney-stomochord complex and a pre-oral ciliary organ, res- 
pectively. From the proboscis a thin triangular element also extends 
posteriorly and dorsally towards the collar. This structure is most 
evident when the proboscis is disarticulated and rotated (Fig. 1g). It 
connects the proboscis to the rest of the trunk and is presumably 
related to the stomochord. An additional Y-shaped structure connects 
the proboscis to the collar region in the same area and may represent 
the nuchal skeleton (Fig. 1b, c, f, g-i). The mouth is located on the 
anterior ventral face of the collar (Fig. 1c, d, f). The short collar is 
usually poorly preserved, and appears as an inflated area around the 
anterior part of the trunk (Figs Ic, e and 2h, i). The pharyngeal region 
accounts for approximately 10-20% of the total length, and it is wider 
towards the mouth and tapers posteriorly (Figs la, f, k and 2 a-e, h-j). 
The dorsal branchial pharynx comprises thick and thin bars, inter- 
preted as gill and tongue bars, respectively. In total there are up to 40 
pairs, and these are connected symmetrically along the dorsal surface 
but with no evidence of synapticulae joining the bars (Fig. 1a, f, g, k, 1). 
Darker structures on the dorsal and ventrolateral sides of the branchial 
pharynx may represent an epibranchial ridge (Fig. 1f, g, k, 1) and 
parabranchial ridges, respectively; the latter serve to demarcate the 
dorsal branchial from the ventral digestive pharynx (Fig. 1f, i-k). 
These ridges merge posteriorly into an ovoid structure, interpreted 
as the oesophageal organ (Figs 1a, f, k and 2a, d, e, h, j). The trunk is 
smooth and featureless, and its width mostly defined by the width of 
the gut. The terminus is bulbous (Figs la and 2a-f, h, i, and 
Supplementary Information 4d, f) and can be as wide as the proboscis. 
The gut is straight and the anus is terminal (Fig. 2d). 

Approximately one-quarter of specimens are associated with tubes 
that have a fibrous composition (Fig. 2h, j). They vary from straight, 
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Figure 1 | Spartobranchus tenuis (Walcott, 1911) individuals from the 
Burgess Shale. a, Complete specimen, ROM 62123. b-e, Close-up views of the 
anterior section. b, Lateral view, USNM 202861. c, Oblique view, GSC 136961. 
d, Ventral view, ROM 62124. e, Oblique view, USNM 202781. f, Lateral view, 
USNM 202183/202493 (see also Supplementary Information, section 2b, c). 
g, Oblique view, ROM 62125. h, Scanning electron microscopy image of the 
framed area in g, ROM 62125. i, Oblique view, ROM 62126. j, Lateral view, 


completely circular, branching to helicoidal (Fig. 2). No more than one 
individual per tube has been observed, and it is evident from the varied 
(Fig. 2) positions in which the worm was buried that it was not perma- 
nently attached. These range from being centred (Fig. 2b, c), located 
close to one end (Fig. 2a, d, e), to being fully extended (Fig. 2b, c), or 
contorted or twisted (Fig. 2d-f). Variation in both tube shape and 
disposition of the occupant suggests that the association was compara- 
tively free, although the bulbous posterior may have been used as an 
anchor. In times of stress the worms may have vacated their tubes (or 
may conceivably have produced tubes). Specimens in which the pro- 
boscis is in close proximity to the terminal trunk (Fig. 2h, i) may 
represent a quiescent state. The close match between proboscis and 
tube when the worm is at one end of the burrow (Fig. 2d, e, j) suggests 
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USNM 202472. k, Lateral view, ROM 62127.1, Ventral view, USNM 202780. C, 
collar; CO?, pre-oral ciliary organ; EBR, epibranchial ridge; GB, gill bars; KHS?, 
kidney-heart-stomochord complex; M (or M?), mouth; NS?, nuchal skeleton; 
OO, oesophageal organ; P, proboscis; PC?, proboscis coelom; PR, parabranchial 
ridges; PT, posterior trunk; T, trunk; TB, tongue bars. Question marks indicate 
uncertainty. Scale bars represent 2 mm (a, f, j,k, 1), 1 mm (b, ¢, e, g), and 0.5 mm 
(d, h, i). See also Supplementary Information, section 4. 


that the proboscis was responsible for tube construction. Tubes are 
often corrugated (Fig. 2b, c, f), presumably reflecting local thickening 
to impart strength. It is likely that the tube of S. tenuis provided a stable 
shelter, and that the side-branches were used for escape (Supplemen- 
tary Information, section 5). The absence of obvious sediment in the 
gut suggests that S. tenuis may have pumped water using its ciliated 
gill slits. 

The relationship between hemichordates and echinoderms as ambu- 
lacrarians is well supported'**, and hemichordates continue to have a 
critical role in early deuterostome and chordate evolution studies. 
Nevertheless, although gill slits are homologous’, other features (for 
example, the stomochord and post-anal tail) that were once thought to 
be shared with chordates are probably convergent”. These uncertainties 
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Figure 2 | Spartobranchus tenuis (Walcott, 1911) individuals associated 
with tubular structures, from the Burgess Shale. a, Linear tube, USNM 
203032. b, c, Linear and branching tube, USNM 509820. d, e, U-shaped tube, 
USNM 509806/509814. f, Branching tube, ROM 57170. g, Helicoidal tube, 


serve to amplify yet further the difficulties in delineating the nature of the 
urdeuterostome. The notably disparate anatomy of enteropneusts and 
pterobranchs has also led to disagreement as to whether very early deu- 
terostomes were solitary and motile’®, or colonial and semi-sessile’. The 
idea that they were solitary and motile is more consistent with the develop- 
ment of the harrimaniid Saccoglossus*”***, but more recent molecular 
analyses tend to identify both pterobranchs and harrimaniids either in 
a basal clade’ or in a basal polytomy* within hemichordates. In the 
absence of a clear phylogenetic signal, the morphology of the primitive 
hemichordate must remain open to question. In principle, fossils might 
provide direct evidence, but the available record is strongly biased in 
favour of pterobranchs, including graptolites’*", whose collagenous 
periderm has a high preservation potential. They are an important 
component of early to middle Palaeozoic-era marine biotas. In con- 
trast, the soft zooids are very seldom preserved and shed no light on 
hemichordate origins”, and given their soft-bodied anatomy, the fossil 
record of enteropneusts is exceptionally meager’ and restricted to 
occurrences in Konservat-Lagerstatten. Indirect evidence from trace 
fossils is mostly equivocal’®’’, although a Triassic-period record’® is 
consistent with an early invasion of the deep sea by torquaratorid enter- 
opneusts®. 

Here we provide direct evidence for a Cambrian enteropneust, and 
thus extend their known range by approximately 200 million years. 
This discovery has two important implications. In S. tenuis, the 
absence of synapticulae between the gill and tongue bars, and of hep- 
atic sacs on the trunk, point to affinities with present-day harrimaniid 
worms. A generic relationship is more difficult to resolve because 
several key soft tissue characteristics of the internal anatomy are absent 
or not preserved, including details of the collar nerve cord, coelomic 
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ROM 62128. h, i, Circular tube, USNM 275692. j, Partially disassociated tube, 
ROM 62129; see also Supplementary Information, section 5a. A, anus; B, 
branching structure; DIS, disassociated; G, gut; OLB, organic-lined burrow. 
Scale bars represent 5 mm (a-f, h-j) and 2.5 mm (g). 


diverticula, gonads or a juvenile post-anal tail. Two lines of evidence 
suggest that the collagenous coenecium of pterobranchs had origins in 
the organic tubes of S. tenuis. First, the tubes of S. tenuis have a fibrous 
structure that sometimes shows evidence of tearing. We have no evi- 
dence of characteristic growth bands (fuselli), a pterobranch innova- 
tion that probably appeared with a reduction in size and the evolution 
ofa shield-shaped secretory organ. Second, the secretory proboscis and 
cephalic shield are homologous in enteropneust and pterobranchs 
(both protosoma), and thus the transition from inferred involvement 
of the proboscis in tube construction by S. tenuis to the secretory role of 
the cephalic shield is unproblematic. We suggest that in the lineage 
leading to the Enteropneusta, the tube lining was lost, whereas in ptero- 
branchs it provided a substrate for an astonishing radiation, not least in 
Palaeozoic planktonic ecosystems. 

In conclusion, our study supports the presence of tubicolous harri- 
maniid enteropneusts in the Cambrian, and provides direct evidence 
for a link between harrimaniids and pterobranchs. Whether S. tenuis 
represents a stem-group harrimaniid, a stem-group pterobranch or a 
stem-group of pterobranch plus harrimaniid, or even a stem group 
hemichordate, is not certain. Moreover, it is worth noting that any 
morphological differences between these now distinct clades may have 
been minimal in the Cambrian. However, as S. tenuis seems to be 
closer to the harrimaniids than to any other hemichordates, including 
pterobranchs, a stem-group pterobranch seems unlikely. This would 
suggest that tube construction pre-dated the shift from an harrimaniid 
enteropneust ancestor to the tentaculate pterobranch and that tubes 
were secondarily lost in enteropneusts. Details of the shift to a mini- 
aturized form and coloniality await the discovery of further well- 
preserved material, but Cambrian pterobranchs with the simplified 
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fuselli may provide possible candidates’’. If the primitive position of 
harrimaniids and pterobranchs is confirmed, then the ‘dipleurula’ 
larvae that characterize ptychoderids and echinoderms may be a result 
of homoplasy or secondary loss in the harrimaniid clade. In addition, 
some chordate features that characterize ptychoderids, including gill 
bar synaptaculae and a dorsal nerve cord with a hollow canal (in a few 
species), may be homoplasies. In enteropneusts these characters are 
correlated with large size, and given the present findings, were pro- 
bably absent in their common ancestor with the chordates. 


METHODS SUMMARY 


Sediment that coated sections of some specimens was removed mechanically using 
a micro-engraving tool equipped with a carbide bit. Specimens were photographed 
under different illuminations, and cross-polarized light techniques usually 
provided the best contrasts. Backscatter scanning electron microscopy images of 
uncoated specimens were obtained of fine anatomical details. Pictures of both 
parts and counterparts were merged in Adobe PhotoshopCS6 using the ‘Apply 
Image’ function and ‘Darken’ blending mode (Fig. 1b, d, f, g, i, k, 1). 
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Preservation of ovarian follicles reveals early 
evolution of avian reproductive behaviour 


Xiaoting Zheng’, Jingmai O’Connor’, Fritz Huchzermeyer*, Xiaoli Wang’, Yan Wang', Min Wang* & Zhonghe Zhou® 


The two groups of archosaurs, crocodilians and birds, form an 
extant phylogenetic bracket for understanding the reproductive 
behaviour of dinosaurs. This behaviour is inferred from preserved 
nests and eggs, and even gravid individuals’. Data indicate that 
many ‘avian’ traits were already present in Paraves—the clade that 
includes birds and their close relatives—and that the early evolution 
of the modern avian form of reproduction was already well on its 
way”. Like living neornithine birds, non-avian maniraptorans had 
daily oviposition and asymmetrical eggs with complex shell micro- 
structure, and were known to protect their clutches**. However, like 
crocodilians, non-avian maniraptorans had two active oviducts 
(one present in living birds), relatively smaller eggs, and may not 
have turned their eggs in the way that living birds do’*®. Here we 
report on the first discovery of fossilized mature or nearly mature 
ovarian follicles, revealing a previously undocumented stage in 
dinosaur reproduction: reproductively active females near ovu- 
lation. Preserved in a specimen of the long bony-tailed Jeholornis 
and two enantiornithine birds from the Early Cretaceous period 
lacustrine Jehol Biota in northeastern China, these discoveries indi- 
cate that basal birds only had one functional ovary, but retained 
primitive morphologies as a result of their lower metabolic rate 
relative to living birds. They also indicate that basal birds reached 
sexual maturity before skeletal maturity, as in crocodiles and para- 
vian dinosaurs. Differences in follicular morphology between 
Jeholornis and the enantiornithines are interpreted as forming an 
evolutionary gradient from the reproductive condition in paravian 
dinosaurs towards neornithine birds. Furthermore, differences 
between the two enantiornithines indicate that this lineage might 
also have evolved advanced reproductive traits in parallel to the 
neornithine lineage. 

Three specimens of Mesozoic birds from the Tianyu Natural History 
Museum, Shandong, China (STM), have been found to contain mature 
ovarian follicles (a follicle being the primary oocyte and the sac in 
which it is contained’) preserved in their body cavity. The largest 
specimen (STM2-51 slab and counterslab; Fig. 1 and Supplementary 
Fig. 1) is referable to Jeholornis sp., a basal bird with a bony tail longer 
than that of Archaeopteryx (see Supplementary Information). The 
specimen, in dorsoventral view, preserves approximately 20 follicles 
ventral to the vertebral column and cranial to the pelvic girdle, over- 
lapped by thoracic ribs. The follicles are all round and subequal in size, 
ranging from approximately 7.1 to 8.8mm in diameter (Fig. Ic, d, 
Supplementary Fig. 2 and Supplementary Table 1); the total number 
of follicles may have been higher than that observed in all specimens, 
obscured by overlap. In STM2-51, the follicles are distributed in rows 
between the caudal end of the sternum and the pelvis; the number of 
follicles in each row decreases caudally (three to four proximally, one to 
two distally); however, this pattern may be exaggerated by a crack 
that laterally borders the preserved follicles. In the main slab, the 
follicles are preserved permineralized by a dark macrocrystalline mine- 
ral (too thick to be eggshell; Fig. 1d). In the counterslab, the follicles are 


preserved as pale pink impressions, although some small crystals of the 
black mineral are also present (Fig. 1c). 

The two smaller specimens (STM29-8 and STM10-45; Fig. 2 and 
Supplementary Figs 3-7) are referable to Enantiornithes (see Sup- 
plementary Information); this large clade forms the sister group to 
Ornithuromorpha, the clade that includes living birds (Neornithes), 
and represents the first major avian radiation*’. The two specimens 
vary in proportions of the humerus/ulna and femur/tibiotarsus and the 
shape of the coracoid, indicating that they represent different taxa, 
although poor preservation prevents identifying either specimen at 
the species level (see Supplementary Information). Both specimens 
preserve mature ovarian follicles in their body cavity visible in dorso- 
ventral (slab and counterslab) view. 

Enantiornithes indeterminate (indet.) STM29-8 (Fig. 2c and Sup- 
plementary Fig. 5) preserves at least 12 follicles in situ along the left side 
of the vertebral column next to the synsacrum. Two additional follicles 
are recognized displaced proximally near the left wing. All the follicles 
are nearly circular, with slight variation in size, ranging from 5.8 to 
8.8mm in diameter (Supplementary Table 1). The follicles are pre- 
served as black carbonization in the slab (Fig. 2a, c) and as impressions 
in the counterslab (Supplementary Fig. 4). The surfaces of the follicles 
preserve an uneven lattice structure of fibres, which may potentially be 
imprints of the blood vessels from the highly vascularized perifollicular 
membrane’. 

Enantiornithes indet. STM10-45 (Fig. 2b, d) preserves fewer follicles 
(at least five), subequal in size (ranging from 6.7 to 8.8 mm in diameter; 
Supplementary Table 1). The follicles are located on the left of the 
vertebral column distal to the sternum but cranial to the pelvis; a few 
disarticulated rib elements overlap the follicles. Four well-preserved 
follicles are associated together (possibly obscuring more follicles); an 
additional follicle(s) is caudally displaced slightly from the others (esti- 
mated total of five to seven preserved). The surfaces of the follicles are 
marked by small, black circular spots of mineralization; the roundness 
of these structures suggests that preservation of the follicles in STM10- 
45 may have been bacterially mediated. 

The circular structures interpreted as follicles are clearly located in 
the body cavity of these fossil birds; in both the slab and counterslab 
they are preserved overlapping with bone. The structures are not con- 
sidered to be seeds preserved in the stomach on the basis of their 
morphology and position (E. M. Friis, personal communication), as 
well as comparison with other types of seeds observed in Jehol fossils. 
In Jeholornis prima IVPP (Institute of Vertebrate Palaeontology and 
Paleoanthropology, Beijing) V13274, which preserves seeds in its 
stomach’, the seeds are more numerous, larger (about 10mm in 
diameter), differ in shape (tapered at one end, not circular) and the 
surface is ornamented. The seeds are distributed more ventrally 
whereas the follicles in Jeholornis STM2-51 are dorsally located along 
the vertebral column and just cranial to the pelvis, consistent with the 
position of the ovaries in living archosaurs. Seeds are also preserved in 
the crop of some Jehol birds (Sapeornis, Hongshanornis)"’; in these 
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Figure 1 | Jeholornis sp. STM2-51 preserving mature ovarian follicles. 
a, Photograph of the main slab in dorsal view. Scale bar, 2 cm. b, Interpretative 
drawing of the slab. c, Close-up of the follicles indicated by the red square in the 


specimens, the crop is found cranial to the sternum and the seeds differ 
from the follicles in that they are uneven in size and morphology (less 
round). These structures are clearly not gastroliths, which are com- 
monly preserved in a large number of Jehol ornithuromorph birds"; 
these stones are always preserved nearly as in life: three-dimensional, 
numerous, varying in shape, size and mineral composition, with 
polished surfaces. Furthermore, gastroliths are proportionately much 
smaller than the soft-tissue structures preserved here. The circular 
shape of the preserved structures is consistent with two-dimensional 
preservation of a spherical structure, such as the mature follicles of the 
ovary (single enlarged cells). The absence of any preserved eggshell and 
their clumped association, with no caudally located in situ follicles, 
indicates that the follicles had not entered the oviducts. 

Living archosaurs—crocodilians and birds—differ greatly in their 
reproductive habits in terms of clutch size, nesting behaviour, degree of 
parental care and developmental strategy of young, posing the evolu- 
tionary questions of when, how and why derived avian reproductive 
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main slab photograph. Scale bar, 1 cm. d, Close-up of the follicles in the 
counterslab. Scale bar, 1 cm. 


traits evolved within Dinosauria. The evidence indicates that although 
the non-avian theropod reproductive system retained some primitive 
archosaurian traits (that is, two functional oviducts, hyper-ellipsoidal 
eggs, precocial offspring), derived bird-like characteristics were also 
present (one egg per oviduct per day deposited, larger eggs relative 
to body size, complex eggshell microstructures, asymmetrical eggs, 
parental care)'>*'*. However, information on the soft-tissue anatomy 
of reproductive organs and the sequence and timing of changes in 
reproduction strategy within the Aves class itself remains poorly 
understood. These new specimens provide the first glimpse into how 
basal birds reproduced. They also record a fairly wide bracket within 
the Mesozoic bird phylogenetic tree (Fig. 3), thus revealing different 
stages of evolution. 

In living archosaurs, as in all vertebrates, the female reproductive 
system is generally divided into two separate parts: the ovary and 
oviduct'*". Birds are unique among amniotes in that, although the 
embryo has two ovaries and oviducts, only the left ovary and oviduct 
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are functional in the adult. The kiwi differs from other birds in that 
both the left and right ovaries develop (and are functional), although it 
is only the left oviduct that develops (other exceptions occur, particu- 
larly among birds of prey)’*. The ovary contains all of the female 
reproductive oocytes, the number of which diminishes during the life- 
span of the individual, although the ovary itself enlarges as the bird 
approaches reproductive maturity. Each reproductive cycle some of 
the normally small oocytes mature into follicles, becoming enlarged 
through yolk deposition (vitellogenesis) in preparation for ovulation. 
In crocodiles, similar to birds, the follicles do not mature all at once, 
forming a hierarchy that reflects the order in which follicles will ovu- 
late. Crocodilians have a low metabolic rate and therefore vitellogen- 
esis lasts an extended period of time (months compared to 4-16 days 
in birds)'*”*; as a result, the follicular hierarchy is much less distinct in 
crocodilians and mature follicles are subequal in size’’. Living birds have 
a high metabolic rate and yolk deposition is rapid, resulting in strongly 
apparent follicular hierarchy’ (Fig. 3). No medullary bone (a rapidly 
mobilized source of eggshell calcium in living birds) forms before 


oviposition’®”’; instead, crocodilians access calcium from structural 
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Figure 2 | Enantiornithine specimens preserving 
mature ovarian follicles in dorsal view. 

a, Photograph of the main slab of STM29-8. 

b, Photograph of the main slab of STM10-45. 

c, Close-up of the follicles in STM29-8 preserved in 
the region indicated by the red box in a. d, Close-up 
of the follicles in STM10-45 preserved in the region 
indicated by the red box in b. All scale bars, 2 cm. 


bone’®. Female crocodilians will ovulate a full clutch of up to 60 eggs 
(average clutch size ranges from 12 to 48 among living crocodilians)”’, 
and store the eggs in the distal end of the oviduct for several weeks, with 
egg laying occurring en masse when the embryos are approximately at 
the 18-somite stage’®. Although the adult guards the nest and young, 
eggs are passively incubated and young are not typically fed by the 
adult, a highly precocial developmental strategy. Living neornithine 
birds oviposit daily, with the exception of the paleognaths, which lay 
one egg every 2-6 days”. 

Non-avian maniraptoran dinosaurs are known to have two active 
oviducts, from the paired association of eggs in the body cavity of one 
specimen’. Why living birds lost the right ovary and oviduct is unclear, 
but the most common hypothesis suggests that this was related to the 
need to reduce weight in flight during the reproductive season, the 
female having to carry only a single egg inside rather than two, 
although loss may also have been to alleviate the demands for calcium 
during ovulation”. As the follicles are interpreted as being in the 
ovaries of these specimens, we cannot comment directly on the num- 
ber of functioning oviducts; however, both enantiornithines, in dorsal 
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view, preserve the association of follicles clearly on the left side of the 
body. The position of the ovary is not clear in Jeholornis STM2-51; 
however, the thoracic vertebrae dorsal to the follicles are preserved in 
left lateral view and the follicles appear to be isolated to the left side, as 
in the enantiornithines, suggesting that only the left ovary was active. 
As the closest-related dinosaurs are known to have two functioning 
oviducts, the loss of function in the right ovary is believed to have 
occurred very close to the non-avian maniraptoran-avian transition, 
supporting the hypothesis that its loss is flight related. Notably, it is 
documented in some crocodilians that the right ovary is the larger of 
the two, producing a greater number of mature follicles (and thus 
having a greater weight)’. 

The large and even size of the follicles preserved in these specimens 
suggests that these female individuals were ready or near ready to 
ovulate; if the birds were in the early stages of vitellogenesis, follicle 
size would not be so large and consistent. The slightly higher variation 
in size among the follicles preserved in STM29-8 may suggest that 
vitellogenesis was incomplete. Alternatively, this may indicate that 
vitellogenesis occurred more rapidly and over a shorter period of time, 
producing a more distinct follicular hierarchy, similar to living birds. 
This latter hypothesis is consistent with the smaller number of follicles 
relative to Jeholornis and the more derived phylogenetic placement of 
enantiornithines. The ovaries are not preserved in their entirety. We 
infer that the immature oocytes, previtelline follicles and atretic folli- 
cles in the ovary were either not preserved or are not visible (small, 
obscured by overlap), or potentially a combination of both. We pro- 
pose that the presence of a perivitelline layer and other protective 
layers (absent in immature oocytes)’ facilitated preservation of the 
mature follicles (see Supplementary Information). The absence of eggs 
in the oviduct(s) suggests that ovulation had not yet begun; alterna- 
tively, the animal may have died in the time between laying one egg 
and ovulating another (typically less than 1 hour in living birds), or 
an egg(s) in the oviduct(s) may have been expelled from the body 
after death, similar to the female pterosaur reported from the Jurassic 
of China”’. 

In the long-tailed bird Jeholornis STM2-51 the size of the follicles is 
very consistent (Fig. 1c, d), suggesting a more crocodilian style of 
reproduction, in which a large number of follicles reach maturity near 
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Figure 3 | Reconstructions of living archosaurs 
Crocodylia and Neornithes, as well as the three 
fossilized specimens showing relative ovary 
morphology mapped over a simplified 
cladogram of Archosaur relationships. Green 
lines indicate extant lineages, red lines indicate 
extinct lineages and dashed lines indicate lineages 
with no data. 
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simultaneously so that follicular hierarchy is minimal”. This suggests 
that vitellogenesis occurred more slowly, similar to crocodilians, and is 
consistent with the lower metabolic rate relative to extant taxa pre- 
dicted for basal birds**. The preserved size of the follicles suggests that 
egg size was large relative to the body in basal birds, despite morpho- 
logical restrictions such as distally contacting pubes; eggs would have 
been too large to store in the body and so oviposition would have to 
occur between ovulation, as in other maniraptorans and living birds, 
but unlike crocodiles’. 

Ina healthy and undisturbed female, all mature follicles will ovulate 
and thus the number of mature follicles is a good proxy for clutch 
size”'*. Crocodilians have much larger clutches than birds, but the 
follicle (and complete egg) is much smaller compared to the adult 
body size. In Alligator mississippiensis the mature follicle has a width 
of 45 mm; the follicle/femur ratio changes with age, but averages 0.017 
(refs 15, 24). Comparative data on follicle size is limited, but in Gallus 
mature follicles reach 40 mm (ref. 7) (size exaggerated through domes- 
tication), comparable to that of A. mississippiensis despite their vast 
difference in body size (follicle diameter/femur length ratio 0.51). 
Although the absolute size of the preserved follicles, inferred to be 
slightly exaggerated by compression, is similar between the three speci- 
mens, the adult body size differs considerably: Jeholornis STM2-51 
body mass is estimated at 676 g; enantiornithine STM29-8 is estimated 
to be 125 g; enantiornithine STM10-45 is estimated to be 105 g (ref. 25 
was used to estimate body mass; Supplementary Table 2). As a 
result, each specimen has a distinct egg/body size ratio. Jeholornis sp. 
STM2-51 preserves the largest number of follicles, which are propor- 
tionately the smallest compared to overall body size (follicle diameter/ 
femur length ratio 0.087), consistent with the basal position of this 
taxon within Aves (only more derived than Archaeopteryx). The larger 
enantiornithine STM29-8 preserves a larger number of smaller follicles 
(follicle/femur ratio 0.171), whereas the smaller enantiornithine STM10- 
45 preserves only a few follicles nearly the same size as Jeholornis 
STM2-51, despite the disparity in their overall body size (follicle/femur 
ratio 0.217). These specimens show the same trade-off between egg 
size and clutch size observed in living birds”®. The large clutch size in 
Jeholornis STM2-51 and Enantiornithes indet. STM29-8 is consistent 
with the precocial reproductive strategy inferred for basal birds’. The 
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smaller clutch preserved in Enantiornithes indet. STM10-45 (although 
potentially an artefact of preservation) suggests that enantiornithines 
may also have been diverse in their reproductive strategies. 

Another key difference between living archosaurs is the timing of 
sexual maturation. In crocodilians reproductive maturity has an early 
onset, before the animal reaches skeletal maturity. Modern birds grow 
rapidly, typically reaching skeletal maturity within 1 year, although 
not typically becoming sexually mature until later (2-8 years; only 
6 months in the domestic chicken)”"*. Histological analyses of para- 
vian dinosaurs reveal a crocodilian pattern of reaching sexual maturity 
before skeletal maturity”*. Enantiornithine STM10-45 shows a clear 
absence of fusion in compound bones like the carpometacarpus (Sup- 
plementary Fig. 8). In Enantiornithes, compound bones are known to 
fuse late in ontogeny* and STM10-45 is regarded as a subadult, indi- 
cating that, in at least this lineage of enantiornithines, reproductive 
maturity was achieved before skeletal maturity, as in paravian dino- 
saurs and crocodilians. Histological analysis was attempted on the 
three specimens, but unfortunately no bone could be sampled from 
STM10-45. Samples from STM2-51 and STM29-8 were badly crushed 
and the medullary cavity was not clearly preserved, preventing the 
identification of medullary bone (see Supplementary Information). 
However, it was determined that enantiornithine STM29-8, which 
shows a greater degree of fusion in the compound bones (poor pre- 
servation prevents determining the exact degree of fusion), was skele- 
tally mature and required more than 1 year to reach adult size (see 
Supplementary Information). 

The preservation of reproductive organs such as ovarian follicles 
allows a rare opportunity to confidently identify the gender of these 
specimens and recognize sexually dimorphic traits. The elongate 
paired tail feathers present in some enantiornithines strongly resemble 
those of Confuciusornis, which have been interpreted as a sexually 
dimorphic trait present only in males; this interpretation is controver- 
sial as the feathers do not co-vary with size as expected”. Feathers are 
only preserved in STM29-8 (Fig. 2a). The feathers around the pygo- 
style are clearly preserved and confirm the absence of paired elongate 
rectrices in this reproductively mature (adult) female. 

The ovarian follicles preserved in these Early Cretaceous birds, the 
first discovery of this kind, reveal new clues regarding basal bird repro- 
duction. Based on the bracket placed by the fossil record, the consis- 
tent preservation of the follicles on the left side of the body in all three 
specimens suggests that the right functional ovary was lost at the 
dinosaur-avian transition. Clutch size in basal birds is inferred to be 
plesiomorphically large, consistent with the precocial developmental 
strategy inferred from embryos and juveniles, and information from 
closely related dinosaurs. Within Aves there is an observable trend 
towards smaller clutches with relatively larger eggs that extends from 
long bony-tailed birds closely related to Archaeopteryx into Neor- 
nithes, and clade-specific specializations in reproduction strategy were 
not limited to the crown group. Despite the presence of advanced 
features, the ovary in Mesozoic birds was large with minimal follicular 
hierarchy, and sexual maturity preceded skeletal maturation as in 
crocodilians, both conditions consistent with the lower metabolic rate 
inferred for basal birds and paravian dinosaurs compared to neor- 
nithine birds. 


METHODS SUMMARY 


Bone samples were taken from all three specimens; only a single sample could be 
retrieved from STM10-45 and it could not be used for histological analysis. Two 
samples were taken from each of the other specimens where the bone was already 
broken, as close to midshaft as preservation allowed: the ulna and femur were 
sampled in STM2-51 and the humerus and femur were sampled in STM29-8. The 
samples were taken using a micro-saw and were embedded in EXAKT Technovit 
7200 one-component resin and allowed to dry for 24h. The samples were then cut 
and polished until the desired optical contrast was reached. The samples were viewed 
under normal and polarized light using a Leica DMRX polarizing microscope. 
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Accelerated gene evolution through replication- 


transcription conflicts 


Sandip Paul’, Samuel Million-Weaver', Sujay Chattopadhyay’, Evgeni Sokurenko! & Houra Merrikh! 


Several mechanisms that increase the rate of mutagenesis across 
the entire genome have been identified; however, how the rate 
of evolution might be promoted in individual genes is unclear. 
Most genes in bacteria are encoded on the leading strand of 
replication’*. This presumably avoids the potentially detrimental 
head-on collisions that occur between the replication and transcrip- 
tion machineries when genes are encoded on the lagging strand’~*. 
Here we identify the ubiquitous (core) genes in Bacillus subtilis and 
determine that 17% of them are on the lagging strand. We find a 
higher rate of point mutations in the core genes on the lagging 
strand compared with those on the leading strand, with this difference 
being primarily in the amino-acid-changing (nonsynonymous) 
mutations. We determine that, overall, the genes under strong nega- 
tive selection against amino-acid-changing mutations tend to be on 
the leading strand, co-oriented with replication. In contrast, on the 
basis of the rate of convergent mutations, genes under positive 
selection for amino-acid-changing mutations are more commonly 
found on the lagging strand, indicating faster adaptive evolution in 
many genes in the head-on orientation. Increased gene length and 
gene expression amounts are positively correlated with the rate of 
accumulation of nonsynonymous mutations in the head-on genes, 
suggesting that the conflict between replication and transcription 
could bea driving force behind these mutations. Indeed, using rever- 
sion assays, we show that the difference in the rate of mutagenesis of 
genes in the two orientations is transcription dependent. Altogether, 
our findings indicate that head-on replication-transcription con- 
flicts are more mutagenic than co-directional conflicts and that 
these encounters can significantly increase adaptive structural vari- 
ation in the coded proteins. We propose that bacteria, and poten- 
tially other organisms, promote faster evolution of specific genes 
through orientation-dependent encounters between DNA replica- 
tion and transcription. 

Concurrent DNA replication and transcription leads to conflicts 
that stall replication, especially on the lagging strand, where the two 
machineries meet head-on (Supplementary Fig. 1)°*. Presumably, to 
avoid these encounters that can delay replication, bacteria have co- 
oriented most of their genes with replication by encoding them on the 
leading strand'*. However, it is unclear why, even in species with a 
strong orientation bias such as B. subtilis, 25% of all genes and 6% of all 
essential genes remain on the lagging strand’”. 

We investigated the relation between gene orientation and muta- 
genesis, by analysing the rate of mutations of ubiquitous (core) genes 
from five clonally divergent strains of B. subtilis (Supplementary Table 
1). To be defined as a core gene, the gene had to be present in all five 
strains, have at least 95% nucleotide identity and at least 95% of the 
length coverage, to exclude the highly diverse genes affected by non- 
homologous gene shuffling’. Among the 759 core genes, 132 (17%) 
were encoded on the lagging strand (Supplementary Table 2). Of the 
148 core genes that were determined to be essential'®, only six (4%) 
were on the lagging strand, consistent with the previously described 
strong orientation bias in the essential genes'”. 


We compared the rates of silent or synonymous (dS), and amino- 
acid-changing or nonsynonymous (dN) mutations, of the core genes 
on the leading and lagging strands. The rate of synonymous mutation 
on the lagging was marginally (2%) higher than on the leading strand 
(Fig. 1a). In contrast, the rate of nonsynonymous mutations of the 
lagging strand genes was 42% higher than the genes on the leading 
strand (Fig. 1b, P< 0.0001), without a significant number of outliers 
(Supplementary Fig. 2). The relative increase of dN in the genes on the 
lagging compared with the leading strand was irrespective of the dS 
amount of the corresponding genes (Supplementary Fig. 3). We found 
a similar trend when we analysed the mutagenesis rate in all leading- or 
lagging-strand genes: that is, besides the core genes, on the two strands 
(Supplementary Table 3). However, among the core genes that were 
previously identified as essential, we did not detect a difference in 
structural variability between the two strands (Supplementary Fig. 4). 

The observed difference between the rates at which leading- and lag- 
ging-strand genes vary may be due to orientation-dependent encounters 
between replication and transcription. It was previously suggested that 
transcript length contributes to severity of conflicts on the basis of the 
observation that gene operons oriented head-on tend to be of a rela- 
tively small size*. Interestingly, we found that most (82%) core genes on 
the lagging strand are not grouped together, and are organized as single 
genes rather than operons (Supplementary Fig. 5). Moreover, there is 
a significant bias for genes on the lagging strand to be significantly 
shorter on average, with the average gene size being 48% larger on the 
leading strand (681 compared with 459 base pairs). In particular, genes 
coding for proteins with a length longer than 200 amino acids are 
underrepresented on the lagging strand, with only 26% of genes (34 
of 132 genes) being of this size category, whereas 48% of genes exceed 
200 amino acids on the leading strand (Fig. 2a, P< 0.0001). These data 
are consistent with selection for genes on the lagging strand to be 
shorter, potentially to decrease the frequency of head-on conflicts. 
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Figure 1 | Highly conserved ‘core’ genes show a higher rate of 
nonsynonymous mutations on the lagging compared with the leading 
strand. The mean values for the core genes in relation to diversity are 
presented. The dS (a) and dN (b) values are plotted for the leading (grey) and 
lagging (black) strands. Error bars, s.e.m. A statistically significant difference in 
dN between the two strands is detected with *P < 0.0001. Analysis of statistical 
significance was performed using the Z-test for dS and dN values (see Methods). 
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Figure 2 | For head-on genes, a significant positive correlation exists 
between increased length, mutagenesis and positive selection. a, The 
percentage of genes in each size category for lagging (black) and leading (grey) 
strands in increasing windows of 50 amino acids. b, The fold difference between 
either dN (black) or dS (grey) for lagging compared with the leading-strand 
genes in seven size categories (R” = 0.65 for dN and 0.02 for dS, over length). 
c, dN/dS ratios for genes longer than 200 amino acids on the leading (grey) and 
lagging (black) strands (**P < 0.005). P values were determined using a 2 x 2 
xX test. 


Because gene size probably correlates with conflict level, we ana- 
lysed the relation between nonsynonymous mutation rates and gene 
length. Indeed, there was a significant positive correlation between 
increased length and increased dN on the lagging compared with 
the leading strand (Fig. 2b), with the relative difference in dN being 
most apparent among the genes coding for proteins longer than 200 
amino acids. In this size category (303 genes on the leading strand and 
34 genes on the lagging strand), we also find a positive correlation 
between transcript abundance and dN, with the mean expression 
values being threefold higher in the group with higher dN amounts 
compared with those with the lower dN amounts, only in the lagging- 
strand genes (Supplementary Fig. 6 and Supplementary Table 5). 
Together, the positive correlation between increased mutation rates 
and gene length, as well as expression levels, suggest that replication 
transcription conflicts are probably responsible for the increased 
mutation rates in the lagging-strand genes. 

We examined the ratio of dN to dS, to evaluate the relative strength 
of negative selection against structural changes in the genes coding 


LETTER 


Table 1 | Distribution of genes with convergent amino-acid mutations 
in lagging and leading strands 


Strand Number of genes* Genes with convergent 2x2? Pvalue 
amino-acid mutations 
Lagging 34 8 0.04 
Leading 303 34 


* Genes that encode proteins with length of 200 or more amino acids. 


proteins more than 200 amino acids in length. A significantly higher 
proportion of the genes on the leading strand (about half) are in the 
very low dN/dS range of less than 0.025 compared with those on the 
lagging strand (P = 0.004, Fig. 2c). Furthermore, there are 25 genes 
on the leading strand but none on the lagging strand that lack any 
structural variation at all. Thus, genes that are under especially strong 
selection against nonsynonymous changes are heavily biased to be 
co-oriented with replication. In contrast, almost half of the genes on 
the lagging strand had a relatively higher dN/dS range of more than 
0.05, significantly more than that on the leading strand (P = 0.002). To 
determine whether the increased dN/dS on the lagging-strand genes 
could be in part due to selection for nonsynonymous mutations, we 
looked for the presence of convergent amino-acid mutations, which is 
strong evidence for positive selection and adaptive evolution'’’. 
Among the genes encoding proteins longer than 200 amino acids, 
convergent mutations were detected in 24% of the core genes on the 
lagging strand in contrast to only 11% of the genes on the leading 
strand (P<0.04) (Table 1 and Supplementary Table 4), indicating 
faster adaptive evolution of the corresponding genes. 

We next determined whether replication-transcription conflicts are 
responsible for the increased mutation rates in the lagging-strand 
genes experimentally, using classic reversion assays. The positive cor- 
relation between increased gene length, as well as expression levels 
with increased dN in the lagging-strand genes, strongly suggest that 
conflicts are driving the mutagenesis of the head-on genes (Fig. 2b and 
Supplementary Fig. 6). We engineered strains auxotrophic for his- 
tidine biosynthesis, and integrated an ectopic copy of the histidine syn- 
thetase (hisC) gene with a premature stop codon at the 318th position, 
under the control of the isopropyl-p-thiogalactopyranoside (IPTG 
inducible promoter Pspankny) onto the chromosome, in either the 
head-on or co-directional orientation (Fig. 3). Reversion assays and 
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Figure 3 | Increased rates of mutagenesis in the head-on orientation are 
transcription dependent. a, The amyE locus containing reporter co- 
directionally (HM420) or head-on (HM419) to replication. b, Mutation rates 
on the basis of reversion assays for hisC952 when the gene was oriented head on 
or co-directionally to replication, in the presence (transcription on) or absence 
(transcription off) of IPTG (*P < 0.05). Error bars, 95% confidence intervals. 
We confirmed by sequencing that the transcription-dependent mutations 
were indeed occurring in the hisC allele at amyE (Supplementary Table 9). 
YB955 did not show an increase in hisC952 reversions with IPTG treatment 
(data not shown). 


28 MARCH 2013 | VOL 495 | NATURE | 513 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


a Cellular 
homeostasis 
8% 


Transcription 
regulation 


38% Iron ion 


Sporulation 
37% 


Figure 4 | Functional categories of leading- and lagging-strand genes. Pie 
charts presenting significantly (P < 0.05) overrepresented functional categories 
(as determined by medium stringency using DAVID”) of protein products 

from lagging (a) and leading (b) strands of replication for B. subtilis strain 168. 


fluctuation analyses were performed in the presence and absence of 
transcription, for both orientations (see Methods). In the absence of 
transcription, the orientation of the gene did not affect mutation rates 
(Fig. 3 and Supplementary Table 6). When activated, transcription 
led to an increase in the mutation rate in both orientations, but the 
increase was much more prominent for the head-on orientation (Fig. 3 
and Supplementary Table 6). The dependence of increased mutation 
rates on both transcription and orientation indicates that head-on 
replication-transcription conflicts lead to increased mutagenesis. These 
data demonstrate that, in the absence of transcription, mutagenesis 
rates of the two strands are not significantly different. 

We analysed the functional categories of the core genes on the 
lagging strand and found enrichment in genes for (1) sporulation, 
(2) iron binding, (3) transcription regulation and (4) cellular homeo- 
stasis (Fig. 4). In general, most of these genes represent a variety of 
stress responses. For example, the transcription regulation genes 
include extracytoplasmic function (ECF)-type sigma factors (SigV 
and SigM), which are known to be the most variable of the sigma 
factors and are involved in responding to extracytoplasmic functions, 
as well as master regulators of biofilm formation (SinI and SinR), 
which control the transition from a free-swimming lifestyle to a 
stress-tolerant, biofilm lifestyle. We propose that these particular 
functions are kept relatively variable, possibly to produce population 
heterogeneity that can more easily respond to rapid changes in the 
environment. 

Taken together, our data indicate that genes positioned on the lag- 
ging strand evolve at a significantly higher rate than those encoded on 
the leading strand, and that the mutagenic nature of the lagging strand 
is due, at least in part, to the conflicts between replication and tran- 
scription. The increased rate of mutagenesis in the genes on the lagging 
strand is probably detrimental for many, especially essential genes, and 
is probably responsible for the strong bias for these genes to be co- 
oriented with replication. Two models were proposed previously to 
explain the underlying cause of the bias against head-on orientation of 
genes: (1) the detrimental effect of increased mutagenesis, by Mirkin 
and Mirkin’; (2) increased rate of gene transcript truncations, by 
Rocha and Danchin’. Experimental work from B. subtilis, Escherichia 
coli and Saccharomyces cerevisiae has shown that genomic instability 
(which may increase the rate of mutations) is a potential consequence of 
head-on conflicts*’*"*. Indeed, our findings show that the bias for the 
co-directional orientation of genes in bacteria is at least partly deter- 
mined by negative selection against increased rate of nonsynonymous 
mutations, consistent with the Mirkin and Mirkin model. 
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The genes categorized for the lagging strand were abrB, acuA, acuB, ahpC, 
ahpF, antE, cotM, cspD, cueR, def, fer, gerPF, hpr, ispG, katA, lexA, mrgA, nasE, 
sacY, sda, sigM, sigV, sinI, sinR, spolISB, sspB, sspD, sspl, sspK, sspL, sspM, sspO, 
tnrA, ycnC, ydbP, ydgJ, yhdK, yjbl, ynzD, ypoP, ytzE, yutl, yuxN and ywoH. 


The other side of the coin, however, in contrast to the essential 
genes, is that the increased mutation rate could also be the reason 
for the head-on orientation of certain core genes: that is, head-on 
orientation could be promoted by positive selection. The action of 
positive selection is apparent from the high rate of convergent evolu- 
tion in the proteins coded by genes on the lagging strand—a strong 
indication of their adaptive significance. In contrast to the nonsy- 
nonymous changes, synonymous mutations are generally (but not 
always) considered to be fitness-neutral in nature and their accumula- 
tion over time is expected to be driven largely by genetic drift. Thus 
accumulation of synonymous mutations could be a relatively slow 
process compared with positive selection for the adaptive changes, 
possibly explaining the low level of accumulation of the synonymous 
mutations in the lagging-strand genes compared with the nonsynon- 
ymous mutations. Although it is possible that there is a stronger nega- 
tive selection against synonymous mutations in the lagging-strand 
genes owing to codon usage bias in highly expressed genes, we did 
not see a large difference between codon usage in the two strands’ 
genes (Supplementary Fig. 7). 

It is unclear how the phenomenon, described here, extends to other 
Gram-positive bacteria or to Gram-negative organisms such as E. coli 
and Salmonella. Orientation, transcription and rates of evolution in 
core genes have not been systematically investigated in these organisms, 
and previous studies looking at some aspects of these questions have 
produced contradictory results depending on methodology’””. 

Replication-transcription conflict-mediated mutagenesis could be 
used by many organisms, including eukaryotes, as a universal strategy 
to link gene expression and evolution rate under selection. Genes on the 
lagging strand can evolve at a relatively fast rate because of head-on 
conflicts. Thus a simple switch in orientation, using short sequence 
homologies and recombination-dependent mechanisms, could facili- 
tate evolution in specific genes in a targeted way. Investigating the main 
targets of conflict-mediated mutagenesis is likely to show far-reaching 
biological insights into adaptation and evolution of organisms. 


METHODS SUMMARY 


Strains are listed in Supplementary Tables 1 and 7. Plasmids are listed in Sup- 
plementary Table 8. Relevant properties of strains are described in the text. Strain 
constructions, growth conditions and bioinformatics analyses are described both in 
Methods and Supplementary Methods. Standard procedures were used for the 
reversion assays and calculation of mutation rates and are described (Methods). 
The rate of mutations (dS and dN) were determined on the basis of the possible 
number of changes compared with the actual number of changes observed, either 
for synonymous or nonsynonymous mutations (see Methods for details). 
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Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Strains. For the bioinformatics analysis, fully assembled genomes of five B. subtilis 
strains were downloaded from NCBI GenBank. Those strains were B. subtilis 
subsp. subtilis str. 168, B. subtilis BSn5, B. subtilis subsp. subtilis str. RO-NN-1, 
B. subtilis subsp. spizizenii TU-B-10 and B. subtilis subsp. spizizenii str. W23 
(Supplementary Table 1). 

All experiments were performed using isogenic derivatives of Bacillus subtilis 
strain YB955 (ile, leu, his, met) (Supplementary Table 7). To construct a strain with 
an exogenous copy of histidine synthetase oriented head-on to replication, 
hisC952 was first amplified from genomic DNA from YB955 using the primers 
HM386 (AAA AGT CGA CAA GGA GGT ATA CAT TTG CGT ATC AAA GAA 
CAT TTA AAA C) and HM396 (AAA GCA TGC TCC TTA TGA ATG GGA 
CTT ATA AAA TTT CAG CTA AAA TGG). The PCR fragment was digested 
with SalI and SphI and ligated into the integration vector pDR111 to generate the 
plasmid pHM392. This plasmid was introduced by double crossover into YB955 at 
amyE to generate strain HM419 (leuC427 metB5 hisC952 amyE::{P spankinyy-hisC952 
Head-On spc}). To constructa strain with the hisC952 allele oriented co-directionally 
with replication, a fragment containing hisC952 and P.panknyy Was amplified off of 
pHM392 using primers HM392 (AAA GAA TTC TAA CTC ACA TTA ATT GCG 
TTG C) and HM393 (AAA GGA TCC TGG CAA GAA CGT TGC TCG AGG). 
This fragment was digested with EcoRI and BamHI, inverted and ligated into 
pDRI111 to generate plasmid pHM394. This plasmid was introduced by trans- 
formation into strain YB955, generating strain HM420 (leuC427 metB5 hisC952 
amyE::{Pspank(hy)-hisC952 Co-Directional spc}). See Supplementary Table 8 for a 
list of the plasmids. Candidate spectinomycin-resistant transformants were 
screened for disruption of the amyE locus by starch agar plate assay. Genomic 
DNA was isolated from amylase-deficient transformants and the chromosomal 
region containing amyE was amplified using primers HM205 (TCC AAA CTG 
GAC ACA TGG AA) and HM207 (AAA GAG GCG TAC TGC CTG AA). The 
resulting PCR fragments were sequenced to confirm integration of hisC952 in the 
proper orientation. 

Detection of common orthologous genes. To identify the core genes shared 
between all five strains of B. subtilis, we used B. subtilis subsp. subtilis str. 168 as 
the reference genome and extracted homologues from the other four strains with 
nucleotide sequence identity and length coverage values of at least 95%. We 
excluded the annotated pseudogenes and genes with either internal stop codons 
or non-ACGT characters. 

Identification of leading- and lagging-strand genes. We considered the origin of 
replication to be positioned between ribosomal protein L34 and chromosomal 
replication initiation protein (DnaA), whereas the terminus region before replica- 
tion terminator protein (Rtp) for all five strains. Depending on these locations, we 
defined the genes present in leading- and lagging strands for each strain. For 
analysis, we considered those genes that were present in the same strand (either 
leading or lagging) for all five strains. In total we found 759 core polymorphic 
genes: 627 in the leading strand and 132 in the lagging strand (Supplementary 
Table 2). 

Molecular evolutionary analysis. Alignment of every gene set was performed by 
clustalW~*. The rates of nonsynonymous (dN) and synonymous (dS) mutations 
for each gene were computed by the ratio of number of nonsynonymous and 
synonymous changes to the total number of nonsynonymous and synonymous 
sites respectively, using the mutation-fraction method”. To assess the statistical 
significance between any two sample sets, a Z-test was applied to dS and dN values”. 

Detection of convergent mutations was performed by Zonal Phylogeny soft- 
ware’’. Zonal Phylogeny reconstructs an unrooted protein phylogram on the basis 
of nucleotide sequences where several alleles differentiated by synonymous 
changes are collapsed into single protein variant, followed by the mapping of 
phylogenetically unlinked repeated mutations at specific amino-acid positions 


(that is, convergent amino-acid mutations). DnaSPv5 was used to calculate the 
2X27? statistics”. 

Media and growth conditions. For all experiments, cells were grown at 37 °C, 
shaking at 260 r.p.m. Liquid cultures of Bacillus subtilis were grown in rich med- 
ium (Luria-Bertain (LB)) supplemented with spectinomycin (50 pg ml ') where 
appropriate. E. coli was grown in LB supplemented with ampicillin at 100 ug ml’. 
To determine viabilities, B. subtilis cells were plated on solid agar plates containing 
Spizizen’s Minimal Medium (0.2 mg ml * ammonium sulphate, 1.4 mg ml” * mono- 
basic potassium phosphate, 0.6 mg ml dibasic potassium phosphate, 0.1 mg ml”! 
sodium citrate dihydrate, 0.02mgml~’ magnesium sulphate heptahydrate, 
100g ml’ glutamic acid, 5gml~' glucose), supplemented with isoleucine, 
methionine, leucine and histidine at 50 pg ml 1. To select for reversion to proto- 
trophy, cells were plated on solid agar plates containing Spizizen’s Minimal 
Medium supplemented with isoleucine, methionine and leucine at 50 tg ml’. 
Reversion assays. Fluctuation assays were performed to determine the mutation 
rate. Strains HM419 and HM420 were grown overnight in LB supplemented with 
spectinomycin at 50 jig ml’. Saturated cultures were diluted 1:10* into 2 ml LB 
medium and 2 ml LB medium supplemented with 1 mM IPTG. For these experi- 
ments, 12 parallel cultures for each strain and condition were grown for 5h at 
37 °C with shaking at 260 r.p.m. When cells reached saturation, 100 ll of culture 
was withdrawn, diluted 1:10° and spread onto solid agar plates containing 
Spizizen’s Minimal Medium supplemented with all required amino acids to deter- 
mine the number of viable cells present in the culture at plating. The remainder of 
the culture was pelleted by centrifugation, re-suspended in Spizizen’s Minimal 
Salts and spread onto solid agar plates lacking histidine. Revertant colonies were 
counted after 48 h of growth at 37 °C. The experiment was repeated three times on 
different days. 

Confirmation of reversions in the hisC gene at amyE. To determine whether 
reversions were occurring in the copy of hisC at amyE, 28 colonies, 24 head-on and 
four co-directional, from plates with IPTG were sequenced. The revertant colonies 
were grown up in selective (histidine— spectinomycin+) media and genomic 
DNA was prepared. The amyE region was amplified by PCR with primers 
HM205 (TCC AAA CTG GAC ACA TGG AA) and HM207 (AAA GAG GCG 
TAC TGC CTG AA). The PCR products were sequenced using the primer HM396 
(AAA GCA TGC TCC TTA TGA ATG GGA CTT ATA AAA TTT CAG CTA 
AAA TGG). Changes at position 952 were analysed (Supplementary Table 9). 
Calculation of mutation rates. Mutation rates were calculated using the Ma- 
Sandri-Sarkar maximum-likelihood method?**°, with the aid of the Fluctuation 
Analysis Calculator. Statistical significance was determined using a standard two- 
tailed t-test. 
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Embryonic stem (ES) cells are pluripotent and characterized by 
open chromatin and high transcription levels, achieved through 
auto-regulatory and feed-forward transcription factor loops’. ES- 
cell identity is maintained by a core of factors including Oct4 (also 
known as Pou5f1), Sox2, Klf4, c-Myc (OSKM) and Nanog”*, and 
forced expression of the OSKM factors can reprogram somatic cells 
into induced pluripotent stem cells (iPSCs) resembling ES cells**. 
These gene-specific factors for RNA-polymerase-II-mediated tran- 
scription recruit transcriptional cofactors and chromatin regula- 
tors that control access to and activity of the basal transcription 
machinery on gene promoters”*®. How the basal transcription 
machinery is involved in setting and maintaining the pluripotent 
state is unclear. Here we show that knockdown of the transcrip- 
tion factor IID (TFIID) complex affects the pluripotent circuitry in 
mouse ES cells and inhibits reprogramming of fibroblasts. TFIID 
subunits and the OSKM factors form a feed-forward loop to induce 
and maintain a stable transcription state. Notably, transient expres- 
sion of TFIID subunits greatly enhanced reprogramming. These 
results show that TFIID is critical for transcription-factor-mediated 
reprogramming. We anticipate that, by creating plasticity in gene 
expression programs, transcription complexes such as TFIID assist 
reprogramming into different cellular states. 

The basal transcription factor TFIID is central to transcription ini- 
tiation by RNA polymerase II (Pol II), and is composed of TATA- 
binding protein (TBP) and 13 TBP-associated factors (TAFs)°”®. 
TFIID can recognize both specific core promoter sequences as well 
as chromatin modifications associated with active promoters’. 
Interestingly, TFIID subunits and TBP paralogues can act in tissue- 
and development-specific ways”'®’*. We noted high protein levels of 
the TAF5 and TAF6 subunits of TFIID in mouse ES cells compared to 
NIH 373 cells or a fibroblast cell line (Fig. 1a). Many TAF messenger 
RNAs are also expressed at higher levels in ES cells (Supplementary 
Fig. 2). These TFIID subunits are also more abundant in human ES 
cells and iPSCs (Fig. 1b). To test the relevance of TFIID in mouse ES 
cells, we applied lentiviral-mediated knockdown and scored colonies 
for morphology and alkaline phosphatase staining. Notably, reduced 
expression of many TAFs (TAF1, 3, 4, 4b, 5, 6, 9, 11, 12 or 13) resulted 
in a differentiated morphology and reduced alkaline phosphatase 
staining (Fig. 1c and Supplementary Fig. 3). Two exceptions were 
TAF7 and 8, which despite efficient knockdown (Supplementary 
Fig. 4) did not affect ES-cell characteristics. Fluorescence-activated cell 
sorting (FACS) analysis of propidium-iodide-stained cells did not 
reveal apoptosis or strong proliferation defects after TAF knockdown 
(Supplementary Fig. 5). In contrast, reduced expression of Polr2a 
(Fig. 1d and Supplementary Fig. 6) or Pol II inhibition by «-amanitin 


(Fig. le) caused growth defects but failed to induce differentiation. 
Thus, high levels of TAFs rather than high Pol II activity are required 
to maintain ES-cell identity, suggesting that TFIID is directly involved 
in the pluripotent state. 
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Figure 1 | High expression of TFIID is required for maintaining ES cell 
identity. a, Immunoblot analyses of mouse ES cells (ES), NIH 3T3 cells (3T3), 
and a stable line of MEFs (FIB) based on equal cell numbers. Twofold dilutions 
of total cell lysates were loaded as indicated by the histone H3 control. 

b, Quantitative proteomic analyses of human pluripotent cells”. TAFs had an 
average standard deviation (s.d.) of 40% (n > 3). h, human. Values of human 
(h) iPSCs and ES cells are relative to IMR-90 fetal lung fibroblasts. 

c, Knockdown of TFIID subunits induces ES-cell differentiation. C, control 
knockdown (n > 36). d, Knockdown of Polr2a (Pol II) fails to induce ES-cell 
differentiation (P = 0.18 and 0.86, n > 36). AP, alkaline phosphatase. 

e, Inhibition of Pol II by «-amanitin does not induce differentiation (P = 1 in all 
cases, n > 9). f, Quantitative proteomic analysis of TAF6-GFP interactors was 
performed in duplicate with isotope swap. SILAC ratios are shown as black 
dots. **P< 0.01. 
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TFIID complexes can differ in composition depending on cellular 
state or cell type*’®. To determine TFIID composition in mouse ES 
cells, we applied stable isotope labelling by amino acids in cell culture 
(SILAC)-based quantitative proteomics“ to green fluorescent protein 
(GFP)-tagged TAF6 stably expressed at sub-endogenous levels (Sup- 
plementary Figs 7 and 8). Isotope ratios of 20 to 200 were observed for 
TBP and all TFIID TAFs (Fig. 1f and Supplementary Table 1). This 
shows that TFIID forms the canonical complex in mouse ES cells”"®. 

To examine TFIID function further, two independent stable TAF5 
or control knockdown ES cells were established. Quantitative poly- 
merase chain reaction with reverse transcription (RT-qPCR) analysis 
showed reduced mRNA expression of Nanog, Oct4 and Sox2, but 
enhanced expression of the differentiation genes brachyury and 
Bmp2 in stable TAF5-knockdown cells (Fig. 2a). TAF5-knockdown 
cells did not show changes in cell proliferation or cell cycle profiles 
(Supplementary Figs 9 and 10). Genome-wide mRNA profiles showed 
altered expression of 1,857 genes in TAF5-knockdown cells (using 
1.5-fold change with P< 0.05; Fig. 2b and Supplementary Fig. 11). 
Pluripotency genes were downregulated, whereas several neuroecto- 
dermal (engrailed 2, neuronatin 1, Gap43, nestin), mixed ectodermal/ 
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Figure 2 | TFIID is part of the pluripotency circuitry and is required for 
reprogramming. a, RT-qPCR analysis of two independent control 
knockdown (C) and TAF5-knockdown cells (n = 3). b, Genome-wide mRNA 
profiles of TAF5-knockdown versus control knockdown cells. Diagonals 
indicate 1.5-fold change (n = 2, in duplicate). c, ChIP analysis of the genes 
indicated or a control region using antibodies (Ab) to TAF1, TBP or 
haemagglutinin (HA) as a control (C; n = 3). d, Activity of the putative Taf4 
enhancer was tested in MEFs, control knockdown (C) or TAF5-knockdown 
(TAF5 KD) ES cells (n = 3). e, Quantification of OSKM-induced iPSC 
formation from OG2 MEFs under TAF-knockdown conditions based on ES- 
like morphology (n = 3). f, As e, but based on GFP* colonies. *P < 0.05, 
**P <).01. Error bars show s.d. 
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mesodermal (Pitx2, Hoxb8) and mesodermal (brachyury, Mest) dif- 
ferentiation markers were upregulated. No known endodermal genes 
were induced in TAF5-knockdown cells. Gene ontology analysis of 
downregulated genes in TAF5-knockdown cells showed enrichment 
for transcription regulators (Supplementary Fig. 12). Affected bio- 
logical processes included metabolism, development and signal trans- 
duction (Supplementary Fig. 13). Interestingly, mRNA profiles of 
TAF5-knockdown cells showed a high degree of overlap with Oct4- 
knockdown profiles! and Oct4 knockdown reduced expression of 
several TAFs including TAF4, 6 and 9 (Supplementary Figs 14 and 
15). To test the differentiation potential of TAF5-knockdown cells, 
leukaemia inhibitory factor (LIF) was removed from the cultures 
and mRNA expression was examined for selected genes. Whereas 
brachyury mRNA was induced to higher levels, Fgf5 induction was 
reduced on TAF5 knockdown. Similar to Oct4, Sox2 and Nanog, Tafl 
and Taf5 mRNAs were strongly reduced in ES cells on LIF removal 
(Supplementary Fig. 16). 

These results suggest that high levels of TFIID in ES cells are 
required to maintain expression of pluripotency but not of differen- 
tiation genes. This was tested in chromatin immunoprecipitation 
(ChIP)-qPCR assays for TAF1 and TBP. Promoters for Oct4, Nanog 
and Klf4 showed reduced occupancy of TAF1 upon TAF5 knockdown 
(Fig. 2c). In contrast, Bmp2 and engrailed-2 promoters showed 
increased TAF1 binding. ChIP assays for TBP gave similar results, 
but with a lower recovery. Next, we examined whether TFIID is 
directly involved in the pluripotent circuitry of ES cells. Published 
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Figure 3 | TFIID potentiates reprogramming to pluripotent cells. 

a, Quantification of OSKM- and OSK-induced iPSC formation from OG2 
MEFs with TFIID overexpression based on colony morphology (n = 3). b, As 
a, but based on GFP* colonies. c, Quantification of GEP* cells by FACS 
analysis (n = 3). d, Representative FACS profiles with intermediate and high 
GFP * cells enclosed by a box. Fluorescence was recorded at 530 nm (plotted on 
x-axis) and at 575 nm (y-axis) to distinguish GFP from auto-fluorescence. 

e, The intermediate GFP* population (P2) from OSKM plus TFIID-12 
reprogramming was obtained at 8 days after infection by FACS sorting and 
analysed again after 6 days of culture. The percentage of high GFP* cells is 
indicated. f, OSKM plus TFID-12 derived iPSCs are GFP, Sseal and alkaline 
phosphatase (AP) positive. iPS1, 4 and 5 indicate clone number. Original 
magnification, 200. **P < 0.01. Error bars show s.d. 
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ChIP sequencing (ChIP-seq) data’ from ES cells indicated binding of 
Nanog and Oct4 toa region ~10-kb upstream of the mouse Taf4 gene. 
This region contains a near-consensus Sox2/Oct4 site, and Nanog and 
Oct4 binding was confirmed by ChIP-qPCR (Supplementary Fig. 17). 
The upstream region was tested for enhancer activity in a luciferase- 
based assay. Figure 2d shows that this region acts as an ES-cell-specific 
enhancer, which is dependent on TAF5 expression. This indicates that 
TAF4 is included in the feed-forward regulatory circuitry in ES cells. 

Our experiments indicate a requirement for high TFIID levels to 
maintain the ES cell state. To test whether TFIID also has a role in 
cellular reprogramming, primary OG2 mouse embryonic fibroblasts 
(MEFs) were used, which express GFP under the control of the distal 
Oct4 enhancer and promoter'®. OSKM- mediated reprogramming under 
conditions of lentiviral-mediated TAF knockdown showed strongly 
reduced numbers of colonies with ES-cell-like morphology or GFP 
fluorescence (Fig. 2e, f). Importantly, transient knockdown (or over- 
expression, see later) of the tested TAFs did not cause obvious cell death 
or growth inhibition of MEFs (data not shown). Next, TFIID overex- 
pression was tested using two different pMX-based retroviral mixtures: 
TFIID-4, containing the core subunits TAF4, 5 and 6 with TBP; or 
TFIID-12, containing TFIID-4 with TAF3, 4b, 7, 9, 10, 11, 12 and 13. 
Inclusion of TFIID-12 during OSKM-mediated reprogramming of OG2 
MEFs resulted in ~30-fold enhancement in both the number of colonies 
with ES-cell-like morphology and with GFP fluorescence (Fig. 3a, b). 
TFIID-4 caused a ~20-fold increase in ES-cell-like/GFP* colonies. 
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Figure 4| TAF contributions to reprogramming and developmental 
potential of iPSCs. a, Quantification of iPSC colonies based on morphology. 
OSKM-induced reprogramming of OG2 MEFs was tested with TFIID-12, 
TFIID-11 (in which the indicated TAF was omitted), or single TAFs (n = 2). 
b, As a, but based on GFP fluorescence. ¢, Genotyping of 38 iPSC clones 
isolated after OSKM plus TFIID-12 reprogramming. Shown is the number of 
colonies lacking the indicated subunit. d, Reprogramming of human fibroblasts 
is strongly enhanced by co-expression of TAF4 but not empty vector (EV) or 
GFP. The number of colonies with ES-like morphology is indicated (n = 3). 
e, As d, but FACS analysis of Tral-60* cells (n = 3). f, Healthy male mice 
derived from tetraploid complementation with OG2-derived iPSC clone 1. 
*P< 0.05, **P< 0.01. Error bars show s.d. 
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Omission of c-Myc resulted in eight- and fourfold increases caused by 
TFIID-12 and TFIID-4, respectively. Expression of TFIID subunits in 
the absence of OSKM failed to induce any ES-cell-like colony. FACS 
analysis of GFP* cells showed that TFIID-12 and TFIID-4 strongly 
enhanced the number of both intermediate and high GFP* cells 
(Fig. 3c, d and Supplementary Fig. 18). Intermediate GFP* cells (P2 
population; Fig. 3e) of OSKM plus TFIID-12 reprogramming were iso- 
lated by FACS at 8 days after infection, plated for an additional 6 days 
and analysed again for GFP expression (Fig. 3e). This showed that 47% 
of the GFP-intermediate cells were converted into high GFP* cells, 
indicating that the intermediate population may represent unstable 
and partially reprogrammed cells of which many make it to the pluri- 
potent state. Infection of MEFs with TFIID-12 did not affect cell prolife- 
ration, as determined by BrdU incorporation (Supplementary Fig. 19). 

Three GFP* colonies from the OSKM plus TFIID-12 transduction 
were isolated for further characterization. All expressed the ES-cell mar- 
kers Sseal (also known as FUT4) and alkaline phosphatase (Fig. 3f), and 
contained proviral integrations for OSKM with distinct combinations of 
8-9 TFIID subunits (Supplementary Fig. 20). Proviral expression of 
OSKM and TFIID subunits (Supplementary Figs 21 and 22) was silenced 
in these clones. Bisulphite sequencing showed demethylation of the Oct4 
promoter (Supplementary Fig. 23). Genome-wide mRNA profiles of 
these iPSCs strongly resembled ES-cell profiles with high expression 
of OSKM and TFIID subunits (Supplementary Fig. 24 and Supplemen- 
tary Table 2). 

To obtain further insight into the mechanism of TFIID-stimulated 
reprogramming, individual TAF retroviruses were left out from the 
TFIID-12 mixture. Omission of TAF4 abolished the stimulatory effect 
of TFIID-12, as assessed by morphology and GFP fluorescence (Fig. 4a, 
b). TAF4 alone stimulated OSKM-mediated reprogramming of OG2 
MEFs (Fig. 4a, b) or adult fibroblasts (Supplementary Fig. 25), and 
TBP and TAF5 had moderate effects. Thirty-eight colonies obtained 
by OSKM plus TFIID-12 reprogramming were genotyped (Fig. 4c and 
Supplementary Fig. 26), which revealed integration of TAF4, 5, 6 and 7 
retroviruses in all iPSC clones. A genome-wide RNA interference 
screen for determinants of human ES cells identified TAF2, 7 and 12 
(ref. 17). Plotting the results of this study for all TFIID subunits indi- 
cates that the combined TAFs have an even more pronounced effect on 
human ES cells than a set of well-known pluripotency genes (Sup- 
plementary Fig. 27). Next, we found that TAF4 overexpression during 
reprogramming of human fibroblasts by limiting amounts of the 
OSKM virus strongly stimulated iPSC formation, as assessed by colony 
formation and Tral-60 staining (Fig. 4d, e and Supplementary Fig. 28). 
Together, these analyses indicate that TAF4 has a central role in iPSC 
formation from mouse as well as human fibroblasts. 

Pluripotency of OSKM plus TFIID-12 mouse iPSCs was demon- 
strated by teratoma formation (Supplementary Fig. 29). Next, OSKM 
plus TFIID-12 iPSCs were tested for germline transmission by tet- 
raploid complementation, in which diploid iPSCs are the (almost) 
exclusive source for the embryo. This resulted in normal development 
of five male mice from iPSC clone 1. Four mice were fully derived from 
iPSCs as judged from the black coat colour, and one showed a low level 
of chimaerism (Fig. 4f). Genotyping confirmed the iPSC-clone-1 ori- 
gin of these mice (Supplementary Fig. 30). After breeding with wild- 
type females, 12 fetuses were analysed at embryonic day 10.5 and all 
showed GFP* gonads (Supplementary Fig, 31). Together, these experi- 
ments demonstrate pluripotency of TFIID-generated iPSCs. 

Our findings reveal that high levels of TFIID are essential to induce 
and maintain the transcriptional program of pluripotent cells. It is 
unclear why pluripotent cells have this TFIID dependency. Possibly, 
the core promoter sequences of key pluripotency genes are inefficient 
binders of TFIID. Indeed, the Oct4, Sox2 and c-Myc promoters lack any 
TATA sequences and only K/f4 contains a TATA box (Supplementary 
Fig. 32). Reduced expression of TFIID by TAF5 or TAF3 (ref. 18) 
knockdown affected expression of pluripotency genes, but still sup- 
ported transcription of differentiation genes, which are enriched for 
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canonical TATA sequences”” (Fig. 2 and Supplementary Fig. 33). This 
agrees with studies of Taf7, Taf8 or Tafl0 gene disruptions, which 
indicate that these TAFs are particularly important in pluripotent cells 
before gastrulation and less important in differentiated and/or adult 
tissues!°?°?1, 

OSKM-induced reprogramming is inefficient®®, and retroviral co- 
expression of TFIID TAFs greatly stimulated reprogramming (Fig. 3). 
Proviral integrations were detected for only eight or nine TAFs, reveal- 
ing that not all TAFs were required during reprogramming. Partial and 
alternative TFIID complexes have been detected””°”” and may support 
certain transcriptional functions. The TAF4 virus is a common integ- 
rant and essential for TFIID-12-stimulated reprogramming (Fig. 4). 
TAF4 is a structural subunit in the core of TFIID” and an ES-cell- 
specific enhancer is located upstream of the mouse Taf4 gene. We 
speculate that TFIID collaborates with pluripotency factors in an 
auto-stimulatory loop in which high levels of TFIID drive pluripotency 
genes, which in their turn enhance expression of critical TAFs like 
TAF4 (Supplementary Fig. 1). This explains the synergism between 
OSKM and TFIID in reprogramming. 

Genome-wide knockdown screens identified TAFs, Mediator subu- 
nits and chromatin regulatory factors as critical for pluripotency'”??**””. 
Overexpression of Prmd14 (ref. 17) or Brgl and Baf155 (ref. 27) also 
stimulate iPSC formation by the gene-specific OSKM factors. TFIID 
may share a collective mechanism with these general regulators. We 
propose that these regulators enable fibroblasts to cross the epigenetic 
energy barriers for resetting gene expression of the pluripotent state*®. 
This predicts that the energetic costs of trans-differentiation induced by 
other gene-specific factors*** could also be met by overexpression of 
general regulators like TFIID. 


METHODS SUMMARY 


Short hairpin RNA (shRNA)-expressing puromycin-resistant lentiviruses were 
used for knockdown. Quantitative SILAC-based proteomics was performed on 
TAF6-GFP- expressing ES cells. Genome-wide mRNA expression analysis was 
performed using Agilent microarrays. ChIP-qPCR was used to analyse promoter 
occupancies of TFIID. Reprogramming assays were performed using OG2 MEFs 
and pMX-based retroviruses. Retroviral integration and silencing were analysed 
using qPCR. Human fibroblasts were reprogrammed using lentiviruses. Promoter 
methylation was analysed using bisulphite sequencing. Teratoma formation, blas- 
tocyst injections and gonad analysis were performed as described”’. 
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A syringe-like injection mechanism in Photorhabdus 


luminescens toxins 
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Photorhabdus luminescens is an insect pathogenic bacterium that 
is symbiotic with entomopathogenic nematodes’. On invasion of 
insect larvae, P. luminescens is released from the nematodes and 
kills the insect through the action of a variety of virulence factors 
including large tripartite ABC-type toxin complexes” (Tcs). Tcs 
are typically composed of TcA, TcB and TcC proteins and are 
biologically active only when complete**. Functioning as ADP- 
ribosyltransferases, TcC proteins were identified as the actual 
functional components that induce actin-clustering, defects in 
phagocytosis and cell death*’. However, little is known about the 
translocation of TcC into the cell by the TcA and TcB components. 
Here we show that TcA in P. Juminescens (TcdA1) forms a trans- 
membrane pore and report its structure in the prepore and pore 
state determined by cryoelectron microscopy. We find that the 
TcdAl prepore assembles as a pentamer forming an o-helical, 
vuvuzela-shaped channel less than 1.5 nanometres in diameter sur- 
rounded by a large outer shell. Membrane insertion is triggered not 
only at low pH as expected, but also at high pH, explaining Tc 
action directly through the midgut of insects*. Comparisons with 
structures of the TcdA1 pore inserted into a membrane and in 
complex with TcdB2 and TccC3 reveal large conformational 
changes during membrane insertion, suggesting a novel syringe- 
like mechanism of protein translocation. Our results demonstrate 
how ABC-type toxin complexes bridge a membrane to insert their 
lethal components into the cytoplasm of the host cell. We believe 
that the proposed mechanism is characteristic of the whole ABC- 
type toxin family. This explanation of toxin translocation is a step 
towards understanding the host-pathogen interaction and the com- 
plex life cycle of P.Juminescens and other pathogens, including 
human pathogenic bacteria, and serves as a strong foundation for 
the development of biopesticides. 

To understand ABC-type Tc proteins and, consequently, their 
mechanism of pore formation and toxin translocation, we used cryo- 
electron microscopy and single-particle analysis (SPA) to determine 
the structure of the TcdA1 complex at a resolution of 6.3 A (Fig. 1 and 
Supplementary Figs 1 and 2). TcdA1 forms a 17-nm-wide, 25.5-nm- 
long pentamer with a molecular mass of ~ 1.41 megadaltons (MDa) 
(Fig. la). The overall bell-like shape of the complex results from an 
inner, 25.5-nm-long pore that is, for the most part, surrounded by a 
large outer shell (Fig. 1b). Because the crystal structures of neither 
full TcA complexes nor subunits are known, we identified all clearly 
resolved rod-like electron densities and fitted poly-alanine helices into 
them (Fig. 1c, d). Contrary to previous suggestions’ that favoured a 
B-barrel structure as predicted for the pore-forming anthrax protective 
antigen"®, the channel shows a clear «-helical architecture with 25 central 
a-helices. In addition, our computational analysis identified five B-sheets 
in the upper part of the pore (Fig. 1c). Shaped like a vuvuzela (a flared 
funnel) that narrows from its maximum diameter of 3 nm to less than 
1.5 nm, the central pore is composed of an upper, cone-shaped vestibule 
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consisting of at least five f-sheets and five a-helices, and a long, 
tube-shaped channel formed by two consecutive sets of ten o-helices. 

Two large domains per subunit of the pentameric complex form the 
outer shell. Whereas one contains 26 clearly resolved «-helices, the 
other is dominated by -sheets (Fig. 1a, d). To locate these regions in 
the protein sequence, we performed a sequence-based secondary struc- 
ture prediction analysis of TcdA1 (Supplementary Fig. 3). We identified 
a large amino-terminal region spanning amino acids 1-1090, composed 
mainly of a-helices, followed by a large B-strand-dominated region 
(amino acids 1091-1720), which corresponds well to our structure. In 
accordance with our structure for the central pore, the carboxy-terminal 
region of the protein is composed of exceptionally long «-helices (1721- 
2320) and short terminal B-strands (2324-2516). Thus, each subunit of 
the TcdA1 pentamer is composed of an N-terminal o-helical domain, a 
central B-sheet domain and a C-terminal pore-forming domain (Fig. 1d 
and Supplementary Video 1). 

The exceptional length of the helices in the pore-forming domain (up 
to 90 amino acids) allowed us to assign their corresponding sequence to 
them precisely (Fig. le). We identified a series of four helices per pro- 
tomer (2020-2284) arranged as pairs of consecutive long and short 
helices that run antiparallel to each other to form the wall of the tube. 

In our structure, the central pore is closed not only by the loops 
connecting the pore-forming helices, but also by the outer shell that 
occludes the lower part of the pore. We therefore conclude that our 
structure represents the prepore state of TcdA1. To obtain the TcdA1 
complex in the pore state, we reconstituted the complex into liposomes 
(Fig. 2a) and determined its structure at a resolution of 19.9 A using 
cryoelectron microscopy and SPA (Fig. 2b-e and Supplementary 
Fig. 2). We note that the umbrella-like shape of the pore complex 
differs considerably from that of the bell-like prepore structure. 
Whereas the latter resembles the structure of Yen-Tc K::9 from 
Yersinia entomophaga’, the overall shape of the TcdA1 pore complex 
is similar to the anthrax and «-haemolysin toxin pores'’”’. 

To provide a better comparison between pore and prepore, we 
filtered the prepore structure to 19.9A and segmented both maps 
(Fig. 2f, g). When comparing both structures, very large rearrange- 
ments of the major domains between the pore and prepore structure 
become obvious (Fig. 2f, g and Supplementary Fig. 4a, b). The central 
pore is shifted by 115 A towards the lower end of the complex, and its 
lower tubular part penetrates the membrane. Although the resolution 
is low, the structure of the pore suggests that it is open at this position. 
This could be achieved by lateral movement of the transmembrane 
helices during integration into the membrane (Fig. 2g). 

The two domains of the outer shell are rotated and shifted relative to 
each other in a complex manner that results ina more compact subunit 
with greater distance to adjacent subunits (Supplementary Fig. 4a, b). 
Consequently, the outer shell has a larger diameter, is open at both 
sides and encloses mainly the funnel-like part of the central pore, 
forming a large basin (Supplementary Fig. 4c-e). 
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Figure 1 | Cryoelectron microscopy structure of the TcdA1 prepore 
complex. a, Electron density map of the TcdA1 prepore complex. b, Cut-away 
view displaying the central pore. The two domains of the outer shell are 
respectively depicted in green and brown, and the central pore is depicted in 
yellow. c, Fit of identified a-helices into the electron density of the central pore. 
At right are shown slices through the central pore at different heights. Asterisks 
indicate B-sheets. d, Extracted electron density of a protomer with identified 
a-helices. The outlined box shows the localization of the subunit in the 
complete structure. Densities corresponding to the N-terminal «-helical 
domain, the central B-sheet domain and the C-terminal pore-forming domain 
are depicted in green, brown and yellow, respectively. e, Sequence assignment 
to a-helices in the pore-forming domain. The four identified helices are 
depicted as numbered and coloured cylinders with their respective lengths. At 
bottom are shown the sequence-based secondary structure prediction for 
amino acids 2018-2284, the number of amino acids for each predicted helix 
and the expected lengths of the helices. Helices 1-4 include amino acids 2020- 
2105, 2109-2140, 2153-2244 and 2249-2285, respectively. Scale bars, 10 nm. 


Our data strongly suggest that TcdA1 is an o-helical pore-forming 
toxin (#-PFT). Whereas other PFT’ form the pore during membrane 
insertion’*"*, for TcdAl the pore is already present before mem- 
brane insertion, but is shielded by the outer shell of the protein. 
Surprisingly, although sequence-based hydrophobicity analyses of 
TcdAl predict a higher hydrophobicity for the helices inside the 
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Figure 2 | Cryoelectron microscopy structure of the TcdA1 pore and 
comparison with the prepore complex. a-c, Representative fields of 
negatively stained (a) and vitrified (b, c) TcdA1 complexes reconstituted in 
liposomes. Black and white arrowheads indicate molecules in the prepore and 
pore state, respectively. Scale bars, 20 nm. d, Representative class averages of the 
vitrified reconstituted complex. Scale bar, 10 nm. e, Cryoelectron microscopy 
structure of the TcdA1 pore depicted in side and cut-away views. Scale bar, 
10 nm. f, g, Segmentation of maps of the prepore (f) and pore (g) states; the two 
domains of the outer shell are respectively depicted in green and brown, and the 
central channel is depicted in yellow. The cut-away view in the inset in g shows 
that the transmembrane helices need to move laterally to fit into the density and 
to open the pore. Scale bar, 10 nm. 


membrane (Supplementary Fig. 5a), they do not detect clear trans- 
membrane regions as is the case for other structurally characterized 
a-PFTs’*. Structure-based calculations of the surface hydrophobicity 
potential, however, identify strongly hydrophobic stretches in the 
predicted transmembrane region (Supplementary Fig. 5b), indicating 
that the hydrophobicity is increased by the ordered assembly of per se 
moderately hydrophobic «-helices. 

In analogy to AB-type toxins, such as anthrax toxin'’, the Tc com- 
plex is thought to bind first to a receptor on the cell surface before being 
undergoing endocytosis*”. On acidification of the endosome, the pH 
shift induces the insertion of TcA into the endosomal membrane, 
which translocates either the C component or both the B component 
and the C component into the cytosol*’. However, in vitro, that is, 
without receptor and membranes, the prepore TcdA1 complex proved 
to be stable over a wide pH range (pH 4.5 to pH 10) (Supplementary 
Fig. 6) and insertion into preformed vesicles was difficult, although not 
impossible. For more extreme pH values, such as pH 4, we observed 
many aggregates, but many of the non-aggregated complexes resembled 
the pore state of the complex (Supplementary Fig. 6a). Surprisingly, a 
shift to high pH values, such as pH 11, also induced pore formation 
of TcdA1 in vitro (Supplementary Fig. 6e). Depending on protein 
concentration, we also observed monomers, dimers and tetramers of 
pentamers (Supplementary Fig. 6e). This effect is probably caused by 
the interaction of the exposed hydrophobic transmembrane regions 
and was also observed in other pore-forming toxins’®. 

Comparing the structure of this complex obtained by negative-stain 
electron microscopy and SPA with the negative-stain electron micro- 
scopy structure of the prepore complex (Fig. 3a, b) and with the recon- 
stituted complex in liposomes (Fig. 2e) proves that the complex indeed 
represents the pore state of TcdA1. The observed extreme pH values 
needed for the transition probably result from the absence of the 
receptor. Nevertheless, our experiments show that a pH shift induces 
the transition from the prepore state of TcdA1 to the pore state 
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Figure 3 | Structure of the PTC3 holotoxin complex. Side views and central 
cuts of negatively stained TcdA1 and the PTC3 holotoxin complex in the 
prepore and pore states. a, Prepore TcdA1; b, pore TcdA1; ¢, prepore PT'C3 
holotoxin complex; d, pore PT'C3 holotoxin complex. TcdA1 and TcdB2- 
TccC3 are shown in khaki and orange, respectively. Scale bar, 10 nm. 
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(Supplementary Fig. 7a—d and Supplementary Video 2). As described 
for many G-protein-coupled receptors, lowering the pH, with resulting 
protonation of residues, very probably destroys ionic as well as elec- 
trostatic interactions and thereby causes conformational changes’’. 
Our finding that a shift to higher pH values results in the same trans- 
ition indicates that deprotonation of residues, and not only their pro- 
tonation, can release the ionic or electrostatic lock that constrains the 
TcdA1 complex in its prepore conformation. 

In accordance with our observations for TcdA1 reconstitution in 
liposomes, lipid bilayer experiments demonstrated that TcdA1 formed 
ion-permeable channels over a large pH range (pH 4 to pH 11), exhi- 
biting a well-defined single-channel conductance (Supplementary 
Fig. 8a-c). As expected from our electron microscopy studies, the 
pore-forming activity of TcdA1 was higher at extreme pH values. At 
pH 11, the pore-forming activity of TcdAl was more than 100-fold 
greater than at pH 6. This probably explains why Tc toxins also directly 
intoxicate the midgut*, which is highly alkaline in many insects’. 
However, in nature the nematodes release P. luminescens only in the 
haemocoel’’, where it first colonizes midgut cells, which presumably 
endocytose the lethal bacterial toxins”. Although it is tempting to 
speculate that the ability to form a pore at both low and high pH values 
would allow Tc toxins to intoxicate midgut cells in two ways, from the 
haemocoel and at the same time from inside the gut, the process in 
nematodes suggests that this mechanism might not be used in nature. 

We measured the TcdA1 pore when reconstituted into a bilayer and, 
in line with our cryoelectron microscopy structure, found that the 
TcdAl pore is funnel shaped with an entrance diameter of 3.0nm 
and a minimum diameter of about 1.6nm (Supplementary Figs 8d 
and 9 and Supplementary Table 1). This result corroborates our 
structural data, showing that the central pore of TcdA1 is already 
formed in the prepore state and does not change much during mem- 
brane insertion. 

To determine where the B and C subunits bind to the A subunit, we 
first assembled the B and C subunits and then assembled the full PTC3 
holotoxin, composed of TcdA1, TcdB2 and TccC3, and determined 
their structures by electron microscopy and SPA (Fig. 3c, d, Supplemen- 
tary Fig. 7e-h and 10). The TcdB2 and TccC3 subunits form a strong, 
dimeric, oval-shaped complex (Supplementary Fig. 10). Corresponding 
well with biochemical data of the Tc complex from Xenorhabdus 
nematophilus*, only one B-C complex rather than five binds to a 
TcdA1 pentamer directly interacting with the funnel-shaped vestibule 
of the TcdA1 channel (Fig. 3c). pH-induced pore formation of TcdA1 
as described above results in the insertion of the B—C complex into the 
basin formed by the outer shell and the funnel of the inner channel 
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Figure 4 | Syringe-like mechanism for membrane insertion. Model for 
membrane insertion of the PTC3 complex and translocation of the B—C 
complex. The two domains of the outer shell are respectively depicted in green 
and brown, and the central pore of TcdA1 is depicted in yellow. The dimeric 
complex of TcdB2 and TccC3 is depicted in orange. The beige bar indicates the 
membrane bilayer. a, The PTC3 complex is either taken up orally or secreted 
into the haemocoel, where it binds to a yet unknown receptor and is 
endocytosed. b, c, Lowering the pH in the endosome or increasing the pH in the 
insect gut destroys an ionic and/or electrostatic lock in the outer shell of the 
TcdA1 pentamer. The outer shell is then structurally reorganized. As a result, it 
widens and releases the central pore (b), which penetrates the membrane like a 
syringe (c). The pore opens and at the same time the B-C complex is pulled into 
the basin formed by the funnel of the central pore and the outer shell of the TcdA1 
complex. d, There, it is unfolded and transported through the central pore. 


(Fig. 3d). However, we did not observe its partial or full translocation 
through the TcdA1 pore, which probably requires a proton or other 
electrochemical gradient across the membrane, and/or chaperones. 

The information provided by these structures allows us to propose 
the following mechanism for membrane insertion of TcdA1 and pro- 
tein translocation. First the fully assembled holotoxin composed of 
TcdAl, TcdB2 and TccC3 is either taken up orally or secreted into 
the haemocoel of the insect, where it binds as a prepore to a yet 
unknown receptor at the cell surface and is endocytosed (Fig. 4a). At 
that time, the pore is closed and shielded by the outer shell of the 
protein. A pH-sensitive ionic and/or electrostatic lock opens on acidi- 
fication of the endosome or on alkalization of the insect gut (Fig. 4b). 
As a result, the outer shell widens and releases the central pore, which 
penetrates the membrane like a syringe (Fig. 4c). The pore opens and 
at the same time the B-C complex is pulled into the basin formed 
by the central pore and the outer shell, inducing its unfolding and 
translocation (Fig. 4d and Supplementary Video 3). The structural 
data on TcdA1 and PTC3 described here reveal that ABC-type toxins 
are exceptional o-PFTs that insert into membranes using a unique 
syringe-like mechanism. Hence, it serves as a strong foundation for 
understanding the action of other pathogens, including human patho- 
genic bacteria. 


METHODS SUMMARY 


TcdA1 and TcdB2-TccC3 were recombinantly expressed in Escherichia coli BL21- 
CodonPlus and purified using Ni-IDA affinity as well as size-exclusion chromato- 
graphy. Membrane insertion and pore activity was measured using black lipid 
membranes composed of diphytanoyl phosphatidylcholine. For electron micro- 
scopy, the samples were applied to carbon-coated or holey-carbon grids either 
stained with 0.07% uranyl formate or vitrified. Negatively stained complexes were 
imaged with a JEOL JEM-1400 electron microscope equipped with a LaBg cathode 
operated at 120kV. Digital electron micrographs were recorded with a 4k X 4k 
CMOS camera F416 (TVIPS) using minimal-dose conditions. Images of vitrified 
complexes were made with a JEOL JEM 3200FSC electron microscope equipped 
with a field-emission gun at an operating voltage of 200 kV. An in-column omega 
energy filter was used to improve image contrast by zero-loss filtering with a slit 
width of 15eV. Micrographs were recorded with an 8k X 8k TemCam-F816 
CMOS camera (TVIPS) under minimal-dose conditions (15-20 electrons per 
A’). The SPARX software”! was used for all image-processing steps with the excep- 
tion of the initial defocus determination, which was done using CTFFIND3” (Sup- 
plementary Figs 2, 11-14). To identify a-helices, we used the programs PHENIX” 
and VOLTRAC™. For visualization and analysis, we used CHIMERA and imple- 
mented programs~*. Sequence-based secondary structure analysis was carried out 
using the SABLE protein structure prediction server (http://sable.cchmc.org/). A 
detailed description of our methods can be found in Supplementary Information. 
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Critical role of Tribl in differentiation of 
tissue-resident M2-like macrophages 


Takashi Satoh!?, Hiroyasu Kidoya®, Hisamichi Naito®, Masahiro Yamamoto*?, Naoki Takemura’, Katsuhiro Nakagawa’, 
Yoshichika Yoshioka®, Eiichi Morii’, Nobuyuki Takakura*, Osamu Takeuchi!*® & Shizuo Akiral? 


Macrophages consist of at least two subgroups, M1 and M2 (refs 1-3). 
Whereas M1 macrophages are proinflammatory and have a central 
role in host defence against bacterial and viral infections**®, M2 
macrophages are associated with responses to anti-inflammatory 
reactions, helminth infection, tissue remodelling, fibrosis and 
tumour progression’. Trib] is an adaptor protein involved in protein 
degradation by interacting with COP1 ubiquitin ligase’. Genome- 
wide association studies in humans have implicated TRIB1 in lipid 
metabolism* ”’. Here we show that Trib] is critical for the differen- 
tiation of F4/80*MR* tissue-resident macrophages—that share 
characteristics with M2 macrophages (which we term M2-like macro- 
phages)—and eosinophils but not for the differentiation of M1 mye- 
loid cells. Trib1 deficiency results in a severe reduction of M2-like 
macrophages in various organs, including bone marrow, spleen, 
lung and adipose tissues. Aberrant expression of C/EBPa in Trib1- 
deficient bone marrow cells is responsible for the defects in 
macrophage differentiation. Unexpectedly, mice lacking Trib1 in 
haematopoietic cells show diminished adipose tissue mass accom- 
panied by evidence of increased lipolysis, even when fed a normal 
diet. Supplementation of M2-like macrophages rescues the patho- 
physiology, indicating that a lack of these macrophages is the cause 
of lipolysis. In response to a high-fat diet, mice lacking Trib1 in 
haematopoietic cells develop hypertriglyceridaemia and insulin 
resistance, together with increased proinflammatory cytokine gene 
induction. Collectively, these results demonstrate that Trib] is criti- 
cal for adipose tissue maintenance and suppression of metabolic 
disorders by controlling the differentiation of tissue-resident M2- 
like macrophages. 

Members of the tribble family are pseudokinase proteins that are 
conserved among species and implicated in various human diseases, such 
as leukaemia and metabolic disorders. Tribble proteins interact with an 
E3 ubiquitin ligase, COP1, for protein degradation. In thioglycollate- 
elicited macrophages, Tribl is important for interleukin (IL)-12p40 
production to lipopolysaccharide"’. In addition, Trib1 and Trib2 have 
been implicated in acute myeloid leukaemia'*’’. Trib3 can inhibit 
insulin signalling", although Trib3‘~ mice do not show defects in in- 
sulin signalling or glucose homeostasis’*. However, the roles of tribble 
proteins in haematopoietic cell development have not been clarified. 

First, we examined the populations of tissue-resident macrophages 
in mice lacking tribble family genes in the spleen. Although the pro- 
portions of B cells, T cells, dendritic cells and Ly6C™*Macl * inflam- 
matory monocytes were not altered between wild-type and Tribl~'~ 
splenocytes, F4/80* Macl* macrophages, which also expressed Mrcl 
(also called MR), Arg1 and Fizz1 (also called Retnla), were markedly 
reduced and Siglec-F*CCR3* eosinophils were absent in Tribl~'~ 
spleens (Fig. la and Supplementary Figs 1 and 2). In contrast, the 
neutrophil population was increased in Tribl~'~ spleens (Fig. 1a and 


Supplementary Fig. 3). Splenic F4/80* red pulp macrophages phago- 
cytose aged red blood cells and accumulate iron'®’”. Immunohisto- 
chemical staining of spleen sections confirmed the absence of red pulp 
macrophages in Trib] ‘~ spleen (Fig. 1b). Furthermore, Perl’s Prussian 
blue staining of the spleen sections revealed that iron did not accu- 
mulate in Tribl~/~ mice (Fig. 1cand Supplementary Fig. 4). In contrast, 
splenic metallophilic and marginal zone macrophages were normal 
in Tribl-‘~ mice (Supplementary Fig. 5). In addition to the spleen, 
tissue-resident macrophages in other tissues were severely decreased in 
Trib1~'~ mice; however, peritoneal resident macrophages were com- 
parable between wild-type and Trib1~‘~ mice (Supplementary Fig. 6). 
Newly generated Trib2-‘~ and Trib3-‘~ mice did not show any 
defects in myeloid and lymphoid cells in the spleen (Supplementary 
Figs 7 and 8). Because MR, Argl and Fizzl expression is a hallmark 
characteristic of M2 macrophages, we termed this population M2-like 
macrophages. Thus, these findings indicate that Trib] is critical for the 
differentiation of tissue-resident M2-like macrophages and eosino- 
phils in the peripheral organs. 

Differentiation of haematopoietic cells, including macrophages, 
occurs in the bone marrow, followed by their migration to peripheral 
tissues via the bloodstream. Consistent with the defects observed in 
peripheral organs, numbers of F4/80* Macl “ cells and Siglec-F* CCR3* 
eosinophils were severely decreased in the blood and bone marrow 
cells from Tribl~'~ mice whereas numbers of Gr-1™£" neutrophils 
were slightly increased (Fig. 1d, e). However, inflammatory monocytes 
were comparable between wild-type and Trib1~'~ bone marrow cells 
(Supplementary Fig. 9). The adoptive transfer of Tribl~/~ bone 
marrow cells to wild-type mice failed to increase F4/80* macrophage 
numbers in the bone marrow and spleen (data not shown). Further- 
more, competitive transfer of CD45.1* wild-type and CD45.2* 
Tribl‘— bone marrow cells (1:1 ratio) to sublethal-irradiated wild- 
type mice led to severely impaired development of CD45.2* macro- 
phages and eosinophils, and increased the population of neutrophils 
(Supplementary Fig. 10). These findings demonstrate that the defects 
observed in Trib1~’~ mice are intrinsic to haematopoietic cells, and 
that Trib] is critical for regulating the proper differentiation of myeloid 
cells in the bone marrow. To delineate the developmental competency 
of Trib1~‘~ bone marrow cells, we performed colony-forming assays. 
Whereas granulocyte/neutrophil colonies were increased, macrophage 
colonies were severely decreased and eosinophil colonies were not 
generated in Tribl~’~ bone marrow cells compared with wild-type 
cells (Fig. 2a). We found that macrophage colonies could be morpho- 
logically classified into two subgroups, namely aggregated/small and 
diffused/large macrophages (Fig. 2b). Although most wild-type macro- 
phage colonies were aggregated and small, the macrophage colonies 
obtained from Tribl’"” bone marrow cells were diffused and large 
(Fig. 2b). These results indicated that Trib1 is essential for the proper 
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Figure 1 | Lack of tissue-resident M2-like macrophages and eosinophils in 
Trib1-deficient mice. a, Flow cytometric analyses of splenocytes. The 
expression levels of F4/80* and Mac] ~ tissue-resident M2-like macrophages in 
splenocytes are shown (top). The proportions of inflammatory monocytes and 
neutrophils (middle) and eosinophils (bottom) in splenocytes are also shown. 
Similar results were obtained in eight independent experiments. b, Wild-type 
(WT) and Trib1 ‘ spleen sections were stained for F4/80 (red) and B220 
(green). c, Images of Perl’s Prussian blue staining for ferric iron in the spleens of 
wild-type and Tribl_‘~ mice. Scale bars, 50 jum. d, Flow cytometric analysis of 
cell populations in the blood. The percentages of F4/80~ and Macl “ cells in the 
blood are shown. e, Flow cytometric analysis of bone marrow. The percentages 
of F4/80* and Mac” tissue-resident M2-like macrophages in the bone 
matrow are shown (top). The proportions of inflammatory monocytes and 
neutrophils (middle) and eosinophils (bottom) in the bone marrow are also 
shown. Similar results were obtained in three independent experiments (b-e). 


differentiation of bone marrow myeloid cells. Proportions of CMP and 
CLP as well as GMP were comparable between wild-type and Tribl /~ 
mice (see Supplementary Fig. 11 for definitions). Furthermore, the 
expression pattern of transcriptional factors, which are involved in 
myeloid cell differentiation, in GMP and MDP was also comparable 
between wild-type and Trib1~‘~ mice (Supplementary Fig. 12), sug- 
gesting that Tribl controls myeloid cell differentiation downstream 
of GMP. 

Retroviral expression of full-length Trib] in Tribl"~ bone marrow 
cells resulted in a marked increase in aggregated and small macro- 
phage colonies and eosinophil colonies, but decreased granulocyte/ 


Figure 2 | Trib1 controls macrophage, eosinophil and neutrophil 
differentiation via the ubiquitin-binding domain. a, b, Bone marrow cells 
were subjected to colony-forming assays. Each colony was counted depending 
on the morphology (a), and images of the cell types of macrophage colonies are 
shown (b). Eo, eosinophil colonies; G, granulocyte/neutrophil colonies; GM, 
granulocyte/macrophage colonies; M, macrophage colonies. c-e, Role of the 
COP1-binding site in macrophage differentiation. Bone marrow cells lacking 
Trib1 were infected with retroviruses expressing empty vector, full-length Trib1 
(mTrib1) or Tribl(ADQIVPE) mutant. Colony-forming assays were 
performed in retrovirus-infected (GFP*) cells. Images of the retrovirus- 
infected (GFP *) cell types of macrophage colonies are shown in c. d, Semi- 
quantitative PCR analyses were performed to measure the expression of Trib1 
in empty vector, full-length Trib1 (full length) or mutant Trib1(ADQIVPE) in 
macrophage colonies. e, Quantitative PCR analyses were performed for MR 
expression in macrophage colonies expressing empty vector, Trib1 (full length) 
or mutant Trib1(ADQIVPE). Error bars indicate s.d. of duplicates. Similar 
results were obtained in three independent experiments (a-e). f, Immunoblot 
analyses of Mario: C/EBP family molecules. Lineage-negative fractions from 
wild-type, Tribl ‘ cells, Tribl”"” cells expressing full-length and Trib1~ 
cells expressing mutant Tribl(ADQIVPE) were analysed by western blotting. 
g, h, Role of C/EBP« in macrophage differentiation. Bone marrow cells lacking 
Trib1 were infected with retroviruses expressing empty vector, Cebpa shRNA 8 
or Cebpa shRNA 12. Immunoblot analyses of various C/EBP family molecules. 
Lineage-negative fractions from wild-type, Tribl”'~ cells and Trib1 ‘~ cells 
expressing Cebpa shRNA 8 or Cebpa shRNA 12 were analysed by western 
blotting (g). Colony-forming assays were performed in infected (GFP *) cells, 
and images of the retrovirus-infected (GFP *) cell types of macrophage colonies 
are shown (h). Similar results were obtained in two independent experiments 


(£-h). 


neutrophil colonies (Fig. 2c, d and Supplementary Fig. 13). In contrast, 
expression of Tribl lacking the COP1-binding site (Trib1(ADQIVPE) 
mutant) in Trib1~'~ bone marrow cells failed to restore the differenti- 
ation defects. Moreover, expression of full-length Trib1, but not of 
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Tribl1(ADQIVPE) mutant, restored expression of M2 macrophage 
marker genes such as MR (Fig. 2e). Next, we investigated the express- 
ion of potential target proteins of the Tribl1-COP1 axis in Trib1~'~ 
bone marrow cells. Among the transcription factors, C/EBP family 
members are important for determining the balance between granu- 
lopoiesis and monopoiesis’®. We found that expression of C/EBPa, 
but not other transcriptional factors involved in myeloid cell differ- 
entiation, was increased in lineage-negative bone marrow cells and 


macrophage colonies lacking Trib1 (Fig. 2f and Supplementary Fig. 14). 
The expression of full-length Tribl in Trib1~‘~ bone marrow cells 
suppressed the level of C/EBPa expression (Fig. 2f). To address 
whether increased expression of C/EBP« is involved in aberrant mye- 
loid cell differentiation under Trib! deficiency, we genetically inhibited 
Cebpa in Tribl~‘~ bone marrow cells using short hairpin RNAs 
(shRNAs) (Supplementary Fig. 15). The decreased C/EBPo protein 
expression in Tribl~‘~ bone marrow cells was similar to levels of 
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Figure 3 | Impact of Trib1 deficiency on lipodystrophy. a, b, Flow 
cytometric analyses of haematopoietic cells in adipose tissue. The expression 
levels of F4/80° and Macl* or MR™ and Macl* tissue-resident M2 
macrophages in adipose tissue (left and centre) and Siglec-F* and CCR3* 
eosinophils (right) are shown. The total numbers of MR* macrophages and 
eosinophils in adipose tissues from wild-type and Tribl'~ chimaeric mice fed 
a normal diet are shown (b). Error bars indicate s.d. of duplicates. Similar 
results were obtained in three independent experiments. c, Expression levels of 
Tribl, Trib2 and Trib3 in SVFs and MAFs, assessed by quantitative PCR. 
Similar results were obtained in two independent experiments. d, The 
epididymal adipose tissues from wild-type and Tribl ‘~ chimaeric mice fed a 
normal diet were analysed by magnetic resonance imaging (MRI). e, Images of 
the epididymal adipose tissues from each indicated chimaeric mice fed a 
normal diet are shown. f, Haematoxylin and eosin-stained images of paraffin- 
embedded epididymal adipose tissue sections from each indicated chimaeric 
mice fed a normal diet. Scale bars, 50 tm. g, h, The total epididymal fat weight 
was measured (g) and each area of adipocytes was measured (h) in each 
indicated chimaeric mouse. Error bars indicate s.d. of duplicates. i, Images of 
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the epididymal adipose tissues from each indicated chimaeric mice fed a 
normal diet are shown. j, Haematoxylin-and-eosin-stained images of paraffin- 
embedded epididymal adipose tissue sections from each indicated chimaeric 
mice fed a normal diet. Scale bars, 50 um. k, 1, The total epididymal fat weight 
(k) and each area of adipocytes were measured (1) in each indicated chimaeric 
mouse. Error bars indicate s.d. of duplicates. m, Total RNA was prepared from 
wild-type and Trib1 ‘~ chimaeric mice fed a normal diet. The expression levels 
of mRNAs encoding the indicated proteins were determined by quantitative 
PCR. Error bars indicate s.d. of duplicates. Similar results were obtained in three 
independent experiments (e-m). n, Lipolysis assays in wild-type and Tribl‘~ 
mice fed a normal diet. The serum glycerol and NEFA levels were measured in 
vivo. Similar results were obtained in two independent experiments. 0, Total 
RNA was prepared from wild-type and Tribl © chimaeric mice fed a normal 
diet. The expression levels of mRNAs encoding the indicated proteins were 
determined by quantitative PCR. Error bars indicate s.d. of duplicates. Similar 
results were obtained in three independent experiments. Statistical significance 
in b, g, h, k, 1, n, o was determined using the Student’s t-test. *P < 0.05; 
**P< 0.01. 


Limited. All rights reserved 


wild-type cells (Fig. 2g). Repression of Cebpa resulted in an increase 
in the aggregated and small macrophage colonies and eosinophil 
colonies, but a decrease in granulocyte/neutrophil colonies (Fig. 2h). 
Taken together, these data demonstrate that Tribl regulates myeloid 
cell differentiation by altering the expression of C/EBPa in a COP1- 
dependent manner. 

Recent genome-wide association studies revealed that genetic var- 
iants in the loci corresponding to TRIB] are associated with increased 
plasma lipoprotein concentrations, a high risk of ischaemic heart 
disease and myocardial infarction in humans*"°. These studies prom- 
pted us to investigate the role of Trib] in the adipose tissue and liver of 
mice. Because we did not observe histological changes in the liver in the 
absence of Tribl in mice fed normal chow (Supplementary Fig. 16), 
we next examined the adipose tissues. Consistent with observations 
in other organs, MR‘ F4/80* adipose-tissue-resident M2-like macro- 
phages were severely decreased in the stromal vascular fraction (SVF) 
derived from Trib1~/~ epididymal adipose tissues compared with 
wild-type mice (Supplementary Fig. 17). Mice lacking Trib1 in haema- 
topoietic cells also showed a reduced number of M2-like macrophages 
and lacked eosinophils in epididymal adipose tissues (Fig. 3a, b), and 
Tribl mRNA was rarely expressed in the normal mature adipocyte 
fraction (MAF) (Fig. 3c), indicating that the defect is intrinsic to 
haematopoietic cells. In contrast, CD11c*Macl* M1 macrophages 
were hardly present in wild-type and Tribl~’~ adipose tissues from 
mice fed a normal chow diet (Supplementary Fig. 17). Unexpectedly, 
MRI analyses revealed that Tribl deficiency in haematopoietic cells 
severely reduced the abdominal adipose tissues (Fig. 3d). The epidi- 
dymal adipose tissues and size of each epididymal adipocyte were 
significantly smaller in mice lacking Trib] in all tissues as well as in 
haematopoietic cells only, than in littermate wild-type mice fed a 
normal diet (Fig. 3e-h). Thus, the mice showed a lipodystrophic 
phenotype. In contrast, Trib1~'~ mice harbouring wild-type haema- 
topoietic cells did not develop lipodystrophy (Fig. 3e, f). These findings 
demonstrate that Trib1 deficiency in haematopoietic cells is respons- 
ible for development of this pathophysiology. To investigate whether 
loss of M2-like macrophages is responsible for the development of 
lipodystrophy, we supplemented Trib1~'~ mice with these macro- 
phages (Supplementary Fig. 18). After 3 weeks, the epididymal adipose 
tissues and the size of each adipocyte from Tribl ‘~ mice reconsti- 
tuted with these macrophages were significantly larger than in PBS- 
treated Tribl~'~ mice (Fig. 3i-1). Collectively, these data indicate that 
lack of M2-like macrophages, caused by Trib1 deficiency, is important 
for the development of the lipodystrophic phenotype. 

Next, we examined the mechanisms of the lipodystrophy caused by 
Tribl deficiency in haematopoietic cells. Adipose tissues are main- 
tained by balance between lipogenesis and lipolysis. The mRNA levels 
of genes involved in adipocyte differentiation and lipogenesis were 
comparable between wild-type and Trib1~'~ bone marrow chimaeric 
mice (Fig. 3m and Supplementary Fig. 19; see Methods). In contrast, 
the serum levels of non-esterified fatty acids (NEFAs) and glycerol 
were significantly elevated in Trib1~'~ bone marrow chimaeric mice 
(Fig. 3n), indicative of enhanced lipolysis caused by Trib1 deficiency. 
Cytokines produced by macrophages contribute to adipose tissue in- 
flammation and lipolysis. Expression of IL-10 was severely reduced in 
epididymal adipose tissues lacking Tribl in haematopoietic cells. In 
contrast, Tnf and Inos mRNA expression were elevated in adipose 
tissues from Tribl~/~ bone marrow chimaeric mice (Fig. 30). Because 
several reports indicated that IL-10 has a central role in repressing 
lipolysis in adipocytes in addition to having anti-inflammatory 
effects’, it is possible that M2-like macrophages are important 
for maintaining adipose tissues, at least in part, through the produc- 
tion of IL-10. However, the contribution of additional molecules pro- 
duced by the macrophages may also be critical for amelioration of 
lipodystrophy. 

Lipodystrophy is often associated with metabolic abnormalities. 
Although significant differences in levels of the parameters such as 
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serum glucose, cholesterol, triglyceride and insulin were not detected 
between wild-type and Tribl ‘~ bone marrow chimaeric mice fed a 
normal diet, the feeding of mice with a high-fat diet (HFD) led to much 
higher elevations of these parameters in the serum of Tribl ‘~ bone 
marrow chimaeric mice (Fig. 4a). Mice lacking Trib1 throughout the 
whole body and in haematopoietic cells also developed glucose into- 
lerance and insulin resistance on a HFD (Fig. 4b, c and Supplementary 
Fig. 20). In contrast, body weight did not differ significantly between 
wild-type and Trib1~'‘~ bone marrow chimaeric mice fed a HED 
(Supplementary Fig. 21). Whereas numbers of M1-type macrophages 
increased in both wild-type and Tribl~‘~ bone marrow chimaeric 
mice as well as mice lacking Tribl in the whole body fed a HFD, 
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Figure 4 | Mice lacking Trib1 in haematopoietic cells develop exacerbated 
metabolic disorders on a high-fat diet. a, Serum samples were obtained from 
bone-marrow-transferred wild-type and Trib1 /~ littermates fed normal chow 
(NC) or a high-fat diet (HFD). The concentrations of cholesterol, insulin, 
glucose and triglyceride were measured. Error bars indicate s.d. of duplicates. 
b, c, Fasting male bone-marrow-transferred wild-type and Trib1 ~~ littermates 
fed a HFD were challenged with intraperitoneal glucose (b) or insulin (c), and 
blood was sampled for glucose analyses at the indicated times. Data are mean 
s.d. of 5 samples. d, Flow cytometric analyses of haematopoietic cells from 
adipose tissues in wild-type and Tribl ‘~ mice fed a HED. The expression 
levels of MR* and Macl” tissue-resident macrophages are shown (left). The 
MR‘ and CD11c* macrophages in adipose tissue are also shown (right). 

e, f, Haematoxylin-and-eosin-stained images of paraffin-embedded epididymal 
adipose tissue sections are shown (e), and each area of adipocytes was measured 
(f). Scale bars, 50 jum. g, Total RNA was prepared from wild-type and Tribl‘~ 
chimaeric mice fed a HFD. Error bars indicate s.d. of duplicates. The expression 
levels of mRNAs encoding the indicated genes were determined by quantitative 
PCR. Error bars indicate s.d. of duplicates. Statistical significance in a-c, f, g was 
determined using the Student’s t-test. *P < 0.05; **P < 0.01. 
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M2-like macrophages were severely decreased in the absence of Tribl 
(Fig. 4d and Supplementary Fig. 22). Tribl~'~ chimaeric mice had 
smaller adipocytes on a HFD (Fig. 4e, f). Notably, Tnfand Inos mRNA 
were markedly augmented in Tribl~'~ chimaeric mice (Fig. 4g). 
Taken together, these findings suggest that Tribl”'~ bone marrow 
chimaeric mice develop metabolic disorders on a HFD through 
impaired lipid buffering caused by lipodystrophy. 

Although we previously showed that Tribl regulates C/EBPB 
expression in thioglycollate-elicited macrophages", we found that 
M?2-like macrophages were present in Cebpb ‘~ mice (data not 
shown). Thus, Tribl seems to control different target proteins for 
degradation depending on the cell types and their differentiation 
stages. We previously showed that Jmjd3 is essential for M2-like 
macrophage differentiation in response to chitin administration and 
helminth infection’. However, Jmjd3 deficiency did not show any 
defect in tissue-resident M2-like macrophage generation (Supplemen- 
tary Fig. 23). Reciprocally, Trib1 deficiency did not alter the activation 
of M2-like macrophages to chitin administration (Supplementary 
Fig. 24). It can be assumed that macrophage subtypes are more com- 
plex in vivo, although the precise classification of M2-type macro- 
phages in vivo is yet to be clarified. 

Chronic low-grade inflammation in adipose tissues is thought to 
have a central role in the exacerbation of metabolic disorders**”. M1- 
type macrophages infiltrate into obese adipose tissues via MCP1, 
where they are activated in part by saturated fatty acids and cause 
low-grade inflammation by producing proinflammatory cytokines**”’. 
However, tissue-resident M2-like macrophages, which differentiate 
depending on the presence of Tribl in the bone marrow, are sub- 
sequently activated in adipose tissues and the liver in response to 
IL-4 and IL-13 produced from eosinophils by a PPAR-y- and 
PPAR-6-dependent mechanism, respectively*'***°. Further studies 
are required to clarify how tissue-resident M2-like macrophages main- 
tain adipose tissue homeostasis. Trib1 was also essential for the deve- 
lopment of eosinophils, nevertheless, we believe that Trib] maintains 
adipose tissues by controlling M2-like macrophages, as supplementa- 
tion of wild-type macrophages alone could rescue the lipodystrophy 
observed in Tribl~'~ mice. 

Mutations in TRIB1 have been implicated in metabolic disorders of 
humans. It is tempting to speculate that TRIB1 functions in a similar 
manner in humans and that other pathophysiologies, such as tumour 
progression, angiogenesis or tissue remodelling, are affected by these 
tissue-resident M2-like macrophages. 


METHODS SUMMARY 


Mice, reagents, cells and plasmids. Full details are provided in the Methods. 
Colony-forming assay. Bone marrow cells (4 X 10“) were seeded on 3.5-cm dishes 
and cultured for 7 days with Methocult (no. 3534; Stem Cell Technologies) con- 
taining SCF, IL-6 and IL-3 in the presence of 20ng ml’ M-CSF. The numbers of 
GFP-positive cells were counted for colony-forming assays or collected for real- 
time (q)PCR and western blotting. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Mice, cells and reagents. Trib1~'~ mice were generated as described previously". 
Antibodies for flow cytometry were purchased from commercial sources as fol- 
lows: anti-F4/80-FITC (BM8; BioLegend); anti-Macl-PerCP Cy5.5 (M1/70; BD 
Biosciences); anti-Ly6C-FITC (AL-21; BD Biosciences); anti-Siglec-F-PE (E50- 
2446; BD Biosciences); anti-CCR3-APC (83103; BD Biosciences); anti-MR- 
Alexa 488 (MR5D3; BioLegend); and anti-CD11c-Brilliant bright (N418; 
BioLegend). Antibodies and reagents for histochemical analysis were purchased 
from commercial sources as follows: biotinylated anti-F4/80 (eBioscience), 
anti-MOMA-1 (Abcam), Alexa 488-conjugated anti-B220 (BD Pharmingen) 
and streptavidin-Alexa594 (Invitrogen). Antibodies for western blotting were 
purchased as follows: anti-C/EBPa (no. 2295; Cell Signaling Technology); 
anti-C/EBPB (no. E298; Millipore); anti-C/EBP6 (no. 2318; CST); anti-C/EBPe 
(no. sc-158; Santa Cruz Biotechnology); and anti-actin (no. sc-1615; Santa Cruz 
Biotechnology). The probes for quantitative PCR were purchased from Lifescience 
Japan. 

Generation of Trib2~/~ and Trib3~'~ mice. The Trib2 and Trib3 genes were 
isolated from genomic DNA extracted from embryonic stem (ES) cells (GSI-I) by 
PCR. The targeting vectors were constructed by replacing a 2-kilobase (kb) frag- 
ment encoding the Trib2 open reading frame (exon 2) and replacing a 2.5-kb 
fragment encoding the Trib3 open reading frame (exons 2-3) with a neomycin- 
resistance gene cassette (neo), and inserting herpes simplex virus thymidine kinase 
(HSV-TK) driven by the PGK promoter into the genomic fragment for negative 
selection. After each targeting vector was transfected into ES cells, G418 and 
ganciclovir double-resistant colonies were selected and screened by PCR, and 
recombination was further confirmed by Southern blotting. The homologous 
recombinant clones were individually microinjected into blastocysts derived from 
C57BL/6 mice and then transferred to pseudopregnant females. Mating of chi- 
maeric male mice with C57BL/6 female mice resulted in the transmission of 
each mutant allele to the germ line. The resulting Trib2*’” and Trib3‘/~ mice 
were interbred to generate Trib2-'~ and Trib3 ‘~ mice, respectively. All animal 
experiments were performed with approval from the Animal Research Committee 
of the Research Institute for Microbial Diseases (Osaka University). 
Construction of expression plasmids and shRNA vectors. Trib1 and Cebpa 
cDNAs were obtained by PCR from a mouse cDNA library. Full-length or mutated 
Tribl cDNAs were cloned into the pLZR-cmv-ires-GFP vector for retrovirus 
production. For shRNA experiments, the following oligonucleotides were used 
to target mouse C/EBPa: cebpa-8 dn, 5’-gatccGTCGTATGTATTATATCTATA 
TTCAAGAGATATAGATATAATACATACGACTTTTTTACGCGTg-3' and 
cebpa-8 up, 5’-aattcACGCGTAAAAAAGTCGTATGTATTATATCTATATC 
TCTTGAATATAGATATAATACATACGACG-3’; cebpa-12 dn, 5'-gatccATCC 
GATATCAACACTTGTATTTCAAGAGAATACAAGTGTTGATATCGGATT 
TTTTTACGCGT¢g-3’ and cebpa-12 up, 5’-aattcACGCGTAAAAAAATCCGA 
TATCAACACTTGTATTCTCTTGAAATACAAGTGTTGATATCGGAT¢-3’. 
The oligonucleotides were annealed and ligated with the RNAi-ready pSIREN- 
RetroQ-zsGreen vector (Takara). Lowercase letters indicate the site of restric- 
tion enzyme. 

Colony-forming assay. Bone marrow cells (4X 10*) were seeded on 3.5-cm 
dishes and cultured for 7days with Methocult (no. 3534; Stem Cell Tech- 
nologies) containing SCF, IL-6 and IL-3 in the presence of 20ngml_' M-CSF. 
The numbers of GFP-positive cells were counted for colony-forming assays or 
collected for real-time (q)PCR and western blotting. 

Assessment of glucose, insulin and NEFA levels and glucose and insulin sens- 
itivity in vivo. Serum insulin levels were determined by ELISA (Morinaga 
Institute of Biological Science Inc.). Serum glucose, NEFA and cholesterol levels 
were measured with enzymatic kits (WAKO Chemicals). Intraperitoneal glucose 
tolerance tests were performed on 16h fasted mice injected intraperitoneally 
with D-glucose (1.5mgg ' body weight). Serum glucose levels were measured 
immediately before and 30, 60, 90 and 120 min after the D-glucose injection. 
Intraperitoneal insulin tolerance tests were performed on 4h fasted mice injected 
intraperitoneally with insulin (1 mU g ' body weight). Serum glucose levels were 
measured immediately before and 15, 30, 60 and 90 min after the insulin injection. 
Lipolysis assay. In vivo lipolysis assays were performed on 4h fasted mice. Blood 
was collected from orbital veins, and the serum glycerol and NEFA concentrations 
were measured. 

Quantitative PCR analysis. Total RNA was isolated using Trizol (Invitrogen) and 
subjected to reverse transcription with ReverTra Ace (Toyobo) according to the 
manufacturer’s instructions. For quantitative PCR, cDNA fragments were amp- 
lified using real-time PCR Master Mix (Toyobo). Fluorescence from the TaqMan 
probe for each cytokine was detected using a 7500 Real-time PCR System (Applied 
Biosystems). To determine the relative induction of each cytokine mRNA in res- 
ponse to various stimuli, the mRNA expression level of each gene was normalized to 
the 18S rRNA expression level. The experiments were repeated at least twice. 
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Assay ID numbers are: Arg], Mm00475988_m1, Ym1, Mm00657889_mH, 
Fizz1, Mm00445109_m1, MR, Mm00485148_ m1; Tribl, Mm00454875_m1; 
Trib2, Mm0045 4876_ml; Trib3, Mm00454879_m1; Acaca, Mm01304277_m1; 
Fasn, Mm00662319_m1; Acsl1, Mm00484217_m1; PParg, Mm01184322_m1; 
Cebpa, Mm00514283_s1; Cebpb, Mm00843434_s1; Cebpd, Mm00786711_s1; 
1110, Mm00439614_m1; Tnfa, Mm99999068_m1; Inos, Mm00440485_m1. 
Immunoblot analysis. Bone marrow cells were collected and red blood cells were 
lysed using red blood cell lysis buffer (Invitrogen). After two washes with MACS 
buffer, lineage-negative cells were collected by MACS sorting using anti-CD3, 
anti-CD19, anti-B220, anti-Terl19, anti-Grl, anti-DX5, anti-Macl and anti- 
CD11c MACS beads (Miltenyi Biotec). The lineage-negative cells were lysed with 
lysis buffer (20 mM Tris-HCl, pH 7.5, 150mM NaCl, 1mM EDTA, 1% NP40) 
containing Complete Protease Inhibitor Cocktail (Roche). The cell lysates were 
separated by standard SDS-PAGE and analysed by immunoblotting. Antibodies 
against the following proteins were used: C/EBPa, C/EBPB, C/EBP6, C/EBPé and 
actin. The Luminata Forte Western HRP Substrate (Millipore) was used for the 
development of positive signals. 

Flow cytometry. Cell suspensions were prepared by sieving and gentle pipetting. 
After washing with ice-cold FACS buffer (0.5% BSA and 2mM EDTA in PBS, 
pH 7.2), the cells were incubated with antibodies for 15 min and washed twice with 
FACS buffer. Data were acquired using a FACSCalibur flow cytometer (BD), and 
analysed by FlowJo software (Tree Star). 

Retroviral preparation. Retroviruses were produced using PlatE packaging cells 
or transfected with various plasmids. Bone marrow cells were transduced with the 
retroviral supernatants (supplemented with polybrene (Millipore)) twice. After 
the second transduction, the cells were washed twice and re-suspended in RPMI 
medium. 

Magnetic resonance imaging (MRI). An 11.7 T MRI scanner (AVANCE-II 500 
WB; Bruker BioSpin) was used to acquire in vivo mouse images. Mice were 
anaesthetized with 1.0-1.5% isoflurane during the MRI procedures. We used a 
T>-weighted imaging sequence (RARE) with the following parameters: field of 
view, 2.5 X 2.5 cm; matrix size, 256 X 256; slice thickness, 0.5 mm; repetition time, 
5,000 ms; echo time, 25.2 ms; average, 4; scan time, 5 min. 

Histochemical analysis. Frozen 6-|1m tissue sections were fixed in 4% para- 
formaldehyde and blocked with 5% BSA (Invitrogen). For Perl’s Prussian blue 
staining, tissues were fixed with 4% paraformaldehyde in phosphate buffer 
(pH 7.0), embedded in paraffin and stained with Perl’s Prussian blue. 
Generation of bone-marrow-transferred chimaeric mice. Bone marrow cells 
were prepared from wild-type and Trib! ~‘~ mice, and intravenously injected into 
lethally irradiated CD45.1 C57BL/6 mice, wild-type and Tribl”'~ mice. The chi- 
maeric mice were administered neomycin (Sigma) and polymyxin B (Sigma) in 
their drinking water for 8 weeks. The mice were analysed for at least 12 weeks after 
reconstitution. More than 90% of splenocytes from the chimaeric mice were 
CD45.2*. 

Mouse SVF and MAF isolation. Epididymal fat pads from male mice fed a 
normal or HFD were excised and minced in Krebs-Ringer-bicarbonate-HEPES 
buffer (KRBH: 120 mM NaCl, 4 mM KH3PO,, 1 mM MgSO,, 1 mM CaCl, 10 mM 
NaHCO;, 30 mM HEPES, 20 uM adenosine (Sigma), 4% BSA). The tissues were 
then digested with 1 mg ml‘ collagenase type II (Sigma) in PBS at 37°C for 
30 min with gentle shaking and filtered through a 100-mm filter. In SVF pre- 
parations, the digested cells were separated by centrifugation at 700g for 5 min. 
In MAF preparations, the digested cells were separated by centrifugation at 300g 
for 5 min. The resulting cells were washed twice with KRBH buffer before use 
in experiments. 

Metabolic measurements. Serum insulin levels were determined by ELISA 
(Morinaga Institute of Biological Science Inc.). Serum glucose, NEFA and choles- 
terol levels were measured with enzymatic kits (WAKO Chemicals). Intraperitoneal 
glucose tolerance tests were performed on 16h fasted mice injected intraperitone- 
ally with p-glucose (1.5mgg_ | body weight). Serum glucose levels were measured 
immediately before administration and 30, 60, 90 and 120 min after the D-glucose 
injection. 

Lipolysis assay. In vivo lipolysis assays were performed on 4h fasted mice. Blood 
was collected from orbital veins, and the serum glycerol and NEFA concentrations 
were measured. For ex vivo assays, adipocytes were isolated from epididymal pads 
using collagenase, washed twice, re-suspended in Krebs-Ringer buffer at 5 X 10° 
cells ml~' and incubated with shaking for the specified times. The glycerol and 
NEFA concentrations were measured in the supernatants. 

Microarray analysis. Total RNA was isolated from wild-type and Tribl_/~ GMP 
and MDP using TRIzol RNA isolation kit (Invitrogen) and further purified using 
an RNeasy kit (Qiagen). Biotinylated cDNA was synthesized from 100 ng total 
RNA with the Ovation biotin RNA amplification and labelling systems (Nugen) 
according to the manufacturer’s protocol. The product was purified using an 
DyeEx 2.0 spin kit (QIAGEN), fragmented, and hybridized to Affymetrix mouse 
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expression array A430 2.0 microarray chips, according to the manufacturer’s 
protocol (Affymetrix). Staining, washing and scanning of Affymetrix mouse 
Genome 430 2.0 microarray chips was done following the manufacturer’s instruc- 
tions. Robust multichip average (RMA) expression values were calculated using R 
and the Bioconductor affy package. For each probe the changes in expression 
between wild-type and Tribl~‘~ samples were defined as the difference between 


log, values for wild-type and Trib1~'~ cells. Genes were assigned the values of 
their corresponding probe(s). In cases where a gene was associated with multiple 
probes the average value was taken. 

Statistical analysis. The statistical significance of differences between values was 
calculated using the two-tailed Student’s t-test. Values of *P < 0.05, **P<0.01 
were considered to indicate statistical significance. 
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CLASP-mediated cortical microtubule organization 
guides PIN polarization axis 


Klementina Kakar', Hongtao Zhang’, Ben Scheres'*"* & Pankaj Dhonukshe!"* 


Recent evidence indicates a correlation between orientation of the 
plant cortical microtubule cytoskeleton and localization of polar 
cargoes’. However, the molecules and mechanisms that create 
this correlation have remained unknown. Here we show that, in 
Arabidopsis thaliana, the microtubule orientation regulators 
CLASP”? and MAP65 (refs 3, 4) control the abundance of pola- 
rity regulator PINOID kinase*” at the plasma membrane. By loca- 
lized upregulation of clathrin-dependent endocytosis’ at cortical 
microtubule- and clathrin-rich domains orthogonal to the axis of 
polarity, PINOID accelerates the removal of auxin transporter PIN 
proteins*° from those sites. This mechanism links directional 
microtubule organization to the polar localization of auxin trans- 
porter PIN proteins, and clarifies how microtubule-enriched cell 
sides are kept distinct from polar delivery domains. Our results 
identify the molecular machinery that connects microtubule orga- 
nization to the regulation of the axis of PIN polarization. 

In animals, the cytoskeleton establishes and sustains polar domains'!. 
In some plant cells, actin and microtubule networks form mutually 
exclusive structures regulated by plant Rho homologues"’, mediating 
polarization of PIN family auxin efflux carriers'*"*. Notably, cortical 
microtubule orientation correlates with PIN polarization in the shoot 
apex', suggesting another link between cytoskeletal configurations 
and PIN localization. Auxin is a key plant signalling molecule with 
numerous roles in development*"’. Polar localization of PIN proteins 
regulates cell-to-cell auxin transport and auxin distribution-dependent 
plant patterning*°. Therefore, mechanisms regulating PIN locali- 
zation are crucial for plant morphogenesis. Regulation of PIN phos- 
phorylation through PINOID (PID) kinases restricts PIN trafficking to 
one of two opposing polar domains”®, but how the axis of polarization 
is selected remains unsolved. Microtubule regulators CLASP and 
MAP65 are required for precise switches in cortical microtubule orien- 
tation in plant cells’. Reasoning that, like in animals, plant microtu- 
bules might control the cell polarization axis and that microtubule 
organization and PIN polarization correlate’, we investigated whether 
CLASP regulates the axis of PIN polarity. 

PIN2::PIN2-GFP in wild-type cells reveals apical (‘shootward’) 
localization in the epidermis, and basal (‘rootward’) localization in 
the cortex® (Fig. 1a). In clasp-1 mutant cells'®, PIN2 localization shifted 
from rootwards to shootwards in the cortex, with reduced PIN2 levels 
compared to control (Fig. la-c and Supplementary Fig. 1). Polar loca- 
lization of PIN2 became sharper in clasp-1 mutants, without any 
alterations in PIN2 transcript levels (Fig. 1d-f; see Methods and 
Supplementary Fig. 2 for details on intensity quantifications). Induc- 
tion of CLASP in clasp-1 mutants largely rescued cortical PIN2 loca- 
lization defects (Supplementary Fig. 3a-g). Rootward-to-shootward 
shifts in PIN2 localization occur in elongating cortical cells of the wild 
type, but not in the root meristematic cortical cells of many short 
root meristem mutants, indicating that the PIN2 switch in the clasp- 
1 cortex was not related to its reduced root meristem (Supplementary 
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Figure 1 | PIN2 localization and trafficking is altered in the clasp-1 mutant. 
a-c, PIN2::PIN2-GFP localization in wild type (WT) and clasp-1 mutants 
(n> 16 roots) (a, b), and PIN2-green fluorescent protein (GFP) intensity at the 
plasma membrane in the epidermis and cortex of wild type (n > 172 cells from 
18 roots) and clasp-1 mutants (n > 188 cells from 20 roots) (c). FI, fluorescence 
intensity. d, Polar to lateral PIN2-GFP intensity ratio in epidermis and cortex 
of wild type (n > 49 cells from 8 roots) and clasp-1 mutants (n > 56 cells from 
10 roots). e, f, Expression of ACTIN 2 (ACT2) and PIN2 in wild type and clasp-1 
mutants as detected by PCR with reverse transcription (RT-PCR) (e), and 
quantitative RT-PCR (qRT-PCR) (f). g-j, PIN2—GFP aggregation into BFA 
compartments in the epidermis and cortex of wild type and clasp-1 mutants 
after 30 min CHX pre-treatment followed by 30 min BFA plus CHX treatment 
(g, h); and PIN2 recycling to the plasma membrane in epidermis and cortex of 
wild type and clasp-1 after 1h BFA washout in the presence of CHX 

(i, j). h, PIN2-GFP endocytosis in wild type (n > 65 cells from 8 roots) and 
clasp-1 mutants (n > 52 cells from 8 roots). j, PIN2—GFP recycling in wild type 
(n> 82 cells from 9 roots) and clasp-1 mutants (n > 40 cells from 8 roots). c, 
cortex; e, epidermis. Green denotes GFP. Arrowheads depict PIN2 polarity. 
Data are mean and s.d. *P < 0.001 (Student’s t-test). Original magnification for 
all confocal images, X63. 


Department of Biology, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands. Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research and Utrecht 
Institute for Pharmaceutical Sciences, Utrecht University, Padualaan 8, 3584CH Utrecht, The Netherlands. 3Netherlands Proteomics Centre, Padualaan 8, 3584CH Utrecht, The Netherlands. 4Wageningen 
University Research, Droevendaalsesteeg 1, 6708 PB Wageningen, The Netherlands. *Department of Plant Systems Biology, VIB, Technologiepark 927, B-9052 Ghent, Belgium. °Department of Plant 


Biotechnology and Bioinformatics, Ghent University, B-99052 Ghent, Belgium. 


28 MARCH 2013 | VOL 495 | NATURE | 529 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


Fig. 31). Rootward PIN1 localization in vasculature and its trans- 
cript levels remained unaffected in clasp-1 mutants (Supplementary 
Fig. 4a-h). Moreover, ectopically expressed PIN2 remained rootward 
in vasculature of the wild type and clasp-1 mutant (Supplementary 
Fig. 6a, c). Our data indicate that CLASP regulates PIN2 localization 
in the root cortex. 

Treatment with recycling inhibitor brefeldin A (BFA) and trans- 
lation inhibitor cyclohexamide (CHX)"* resulted in increased PIN2 
internalization into BFA compartments in epidermal and cortical cells 
of clasp-1 as compared to cells of wild type (Fig. 1g, h and Supplemen- 
tary Fig. 3m, n). BFA washout in the presence of CHX restored PIN2 
localization at the plasma membrane in wild type and clasp-1 cells, 
suggesting intact PIN2 recycling in clasp-1 mutants (Fig. li, j and Sup- 
plementary Fig. 30, p). Endocytosis and recycling of PIN1::PIN1-GFP 
remained unaffected in clasp-1 mutants (Supplementary Fig. 4i-n). Our 
results reveal enhanced PIN2 endocytosis, and operational PIN2 recyc- 
ling, in the clasp-1 cortical and epidermal layers. 

Because PIN2-mediated auxin distribution is implicated in the gravi- 
tational response’ and root meristem patterning’, and PIN2 locali- 
zation and trafficking are altered in clasp-1 mutants, we analysed 
auxin-related root phenotypes. clasp-1 mutant roots showed reduced 
growth, reduced meristem size, cellular organization defects, abnormal 
gravitropic growth, and delayed responses to stimulation by gravity 
(gravistimulation; Supplementary Fig. 5a—h). Induction of CLASP in 
clasp-1 mutants largely rescued these defects (Supplementary Fig. 3a-c, 
h-k). Although gravistimulated wild-type roots displayed a strong 
asymmetric distribution of the auxin activity reporter DR5::YFP-NLS’, 
this response was delayed and impaired in clasp-1 roots (Supplementary 
Fig. 5i, j). Furthermore, application of the auxin analogues indole-3- 
acetic acid (IAA), 1-naphthaleneacetic acid (NAA), auxin efflux inhi- 
bitor 1-N-naphthylphthalamic acid (NPA), and endogenous auxin 
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Figure 2 | PINOID abundance is enhanced in the clasp-1 mutant. 

a-c, PID::PID-YFP localization (a, b) and fluorescence intensity (c) at the 
plasma membrane in epidermis and cortex of wild type (n > 28 cells from 8 
roots) and clasp-1 mutants (n > 26 cells from 7 roots). d, Polar to lateral PID- 
yellow fluorescent protein (YFP) intensity ratio in epidermis and cortex of wild 
type (n > 38 cells from 10 roots) and clasp-1 mutants (n > 30 cells from 8 
roots). e, Total protein fractions of wild type, clasp-1, PID::PID-YFP and clasp-1 
+ PID::PID-YFP seedlings probed with anti-YFP and anti-actin antibodies. 

f, g, Expression of ACTIN 2 and PID in wild type and clasp-1 mutants as 
detected by RT-PCR (f) and qRT-PCR (g). Green denotes YFP. Data are mean 
and s.d. *P < 0.001 (Student's t-test). Original magnification for all confocal 
images, X63. 
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overproduction*® enhanced DR5::YFP-NLS expression (Supplemen- 
tary Fig. 5k—-m) and partially rescued cellular organization defects of 
clasp-1 roots (Supplementary Fig. 5r, s), but not meristem size (Sup- 
plementary Fig. 5n-q). This suggests functional auxin perception and 
responses in clasp-1 mutants, and the causal relation of clasp-1 root 
meristem defects with shootward PIN2-mediated changes in auxin 
transport. In accordance, expression of rootward PIN2::PIN1-GFP2 
(ref. 9) in clasp-1 mutants partially resumed rootward auxin flux and 
slightly rescued clasp-1 root defects (Supplementary Fig. 5t—w). 

The decision for shootward or rootward PIN localization depends on 
PID action’. In wild-type roots, high epidermal plasma membrane- 
bound PID levels efficiently phosphorylate PIN2 for its shootward 
translocation, whereas low cortical plasma membrane-bound PID 
levels inefficiently phosphorylate PIN2 for its rootward localization® 
(Fig. 2a). PID::PID-YFP displayed more intense plasma membrane 
localization signals and higher protein levels in clasp-1 mutants com- 
pared to wild type (Fig. 2b, c, e), without changes in transcript levels 
(Fig. 2f, g). CLASP induction in clasp-1 mutants restored PID abun- 
dance (Supplementary Fig. 60, p), and PID induction switched PIN2 
from rootward to shootward cell sides in the vasculature and cortex 
(Supplementary Fig. 6a—d), implicating increased PID levels in clasp-1 
mutants for PIN2 relocation. Accordingly, cortically induced PID- 
YFP (CO2::XVE >> PINOID- YFP’) triggered rootward-to-shootward 
PIN2 shifts (Supplementary Fig. 6e, f), and mimicked organ level phe- 
notypes observed in the clasp-1 mutant (Supplementary Fig. 6g-k), 
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Figure 3 | Endocytosis is enhanced in clasp-1 mutants by a PID-dependent 
increment in plasma-membrane localized clathrin. a—e, Evaluation of 
endocytosis in wild type (a), clasp-1 (b), pid wag1 wag2 (c) and G1090::XVE- 
PID (d) roots by analysing FM4-64 internalization (e) (n > 45 cells from 10 
roots each). f-i, Clathrin localization in wild type (f), clasp-1 (g), pid wag1 wag2 
(h) and G1090::XVE-PID (i) roots. j, k, Clathrin intensity at the plasma 
membrane (n > 24 cells from 7 roots each) (j) and its polar to lateral intensity 
ratio (n > 44 cells from 9 roots each) (k). 1, Schematic illustrating the 
correlation between the microtubule orientation-related levels of lateral PID 
and lateral clathrin to the amount of lateral PIN removal in epidermal, cortical 
and vascular cells of wild type and clasp-1 mutants. Line and arrow thicknesses, 
as well as ‘+’ and ‘—’, depict relative amounts of PID, clathrin and PIN. Red 
denotes FM4-64, green denotes anti-CHC (clathrin heavy chain). Arrowheads 
highlight CHC localization at the plasma membrane. Data are mean and s.d. 
*P < 0.001 (Student’s t-test). Original magnification for all confocal images, X63. 
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Figure 4 | Microtubule orientation defines the axis of PIN polarization. 

a, b, PIN2 localization in wild type (rootward in cortex; shootward in 
epidermis) (a) and after PLT2-GR induction (shootward-lateral in epidermis) 
(b). c, PLT2 induced shootward-lateral epidermal PIN2 localization persists 
after periclinal cell divisions. dj PIN2 polarity after PLT2-GR induction in 
clasp-1 mutants (d, e), 35S::MAP65-1,2 amiRNA (g, h) and after PEO-IAA 
treatment (largely shootward in cortex and epidermis) (i, j). Normal PIN2 
localization in wild type after DEX treatment (f). k-p, After PLT2-GR 
induction, clathrin localizes at polar cell sides in the epidermis of wild type 
(k) and after inactive PEO-IAA treatment (0), and at lateral cell sides in 
epidermis of clasp-1 mutants (1), 35S::MAP65-1,2 amiRNA mutants (m), after 
active PEO-IAA treatment (n), and in tirl-1 afb2-1 afb3-1 mutants (p). PLT2- 
GR induction followed by inactive PEO-IAA treatment results in clathrin 
localization at the polar cell sides of epidermis (0). q, s, Polar to lateral PIN2 


whereas PID repression largely rescued PIN2 defects in clasp-1 mutants 
(Supplementary Fig. 6l-n). Together, our results indicate that loss of 
CLASP increases PID levels at the plasma membrane to influence 
PIN2 localization. 

Because increased PIN2 endocytosis correlated with enhanced PID 
levels in clasp-1 mutants, we tested whether PID affected endocytosis. 
We probed endocytosis by internalization of the endocytic tracer 
FM4-64 (ref. 17). Fewer FM4-64-labelled endocytic vesicles were 
present in pid wag1 wag2 (ref. 6) triple mutants, and more were present 


intensity ratio (n > 36 cells from 8 roots each (q); n > 27 cells from 8 roots each 
(s)). r, t, Polar to lateral clathrin intensity ratio (n > 27 cells from 8 roots each 
(r);n > 19 cells from 8 roots each (t)). u, v, Schematic illustrating microtubule 
array organization and PIN2 polarity in epidermal cells of wild type (u) and 
clasp-1 mutants (v), following PLT2-GR induction. Note, in wild type, clasp-1 
mutants and clasp-1 mutants plus PLT2-GR induction, PIN2 localizes to the 
polar cell side, aligning with transversal microtubule array. After PLT2-GR 
induction in the wild type, the microtubules reorient from transversal to 
longitudinal, and accordingly PIN2 shifts its localization from the polar to 
lateral cell side. Line and arrow thicknesses depict the amounts. In a-p, red 
denotes PIN2, green denotes CHC; arrowheads depict PIN2 or CHC 
localizations. 35S::MAP65-1,2 amiRNA targets MAP65-1 and MAP65-2 (ref. 3). 
Data are mean and s.d. *P < 0.001 (Student’s t-test). Original magnification for 
all confocal images, X63. 


in clasp-1 mutants and in induced G1090::XVE >> PID, than in wild 
type (Fig. 3a-e), indicating the influence of PID on endocytosis. As 
clathrin is a major effector of endocytosis in plants’ we tested clathrin 
localization. Clathrin levels increased at the clasp-1 mutant plasma 
membrane and after PID induction, particularly at the lateral domains, 
whereas they decreased at the pid wagl wag2 mutant plasma mem- 
brane, as compared to wild type (Fig. 3f-k), indicating a positive cor- 
relation of PID to clathrin levels to upregulate PIN endocytosis (Fig. 31). 
In accordance, PID-induction-triggered PIN relocation occurs rapidly 
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(by upregulation of PIN endocytosis)°, and the stress-induced shift in 
PIN relocation (requiring PIN endocytosis) is impaired in a pid mutant’. 

Owing to impaired microtubule crossover from lateral to top- 
bottom polar cell sides*’, lateral microtubules were more predominant 
in the clasp-1 cortex and epidermis (Supplementary Fig. 7a—-e). This 
skewed microtubule distribution enhanced PID levels at the lateral cell 
sides of epidermal and cortical cell layers in clasp-1 mutants (Fig. 2d 
and Supplementary Fig. 7h), and CLASP induction in clasp-1 mutants 
restored this defect (Supplementary Fig. 6p), suggesting a causal link 
between axial microtubule density and PID enrichment. Oryzalin treat- 
ment, which depolymerizes microtubules and detaches CLASP from 
microtubules and cell periphery’ , reduced lateral PID levels, decreased 
clathrin abundance and internalization of FM4-64 (ref. 17; Supplemen- 
tary Fig. 7f-n), and increased PIN2 lateral abundance (Supplementary 
Fig. 7o-q). Microtubular manipulation suggests that CLASP-dependent 
microtubule organization regulates lateral PID, clathrin and endocy- 
tosis levels, which in turn control lateral PIN removal. 

CLASP regulates microtubule reorientation in conjunction with 
PLETHORA (PLT) transcription factors, which induce MAP65 micro- 
tubule regulators through TIR1 auxin signalling’. MAP65-1 and 
MAP65-2 activity are required for CLASP function’. Notably, roots 
of MAP65-1,2 amiRNA lines (which target MAP65-1 and MAP65-2)’, 
map65-1 map65-2 double mutants’ and firl-1 afb2-1 afb3-2 auxin 
signalling triple mutants'® displayed root gravitropism, meristem size 
and PIN2 localization defects, all reminiscent of clasp-1 phenotypes 
(Supplementary Fig. 8a-k). Although PID messenger RNA levels 
remained unaffected, the auxin signalling antagonist PEO-IAA”’ and 
triple-mutant tirl-1 afb2-1 afb3-1 increased PID levels at the lateral 
plasma membrane (Supplementary Fig. 8l-r), indicating the require- 
ment of TIR1 and MAP65 for CLASP-mediated PID localization and, 
consequently, for defining the axis of PIN2 localization. 

To test directly whether TIR1 and MAP65 activation influences the 
polarization axis, we used the dexamethasone (DEX)-inducible PLT2- 
GR system’, which triggers cortical microtubule reorientation’. At 
~18h after PLT induction, PIN2 localization shifted from shootward 
towards lateral cell sides in root epidermal cells of wild type but not of 
clasp-1 mutants, correlating with microtubule crossover at top-bottom 
cell sides (Fig. 4a, b, d, fand Supplementary Fig. 7a—e), without altering 
PIN2 transcript levels (Supplementary Fig. 10k). This suggests that the 
axis of PIN2 localization in root epidermal cells is modified after micro- 
tubule reorientation but before cell division plane rotation (Fig. 4a-e, s). 
This PLT2 effect was suppressed in MAP65-1,2 amiRNA (Fig. 4g, h, s), 
in tirl-1 afb2-1 afb3-1 seedlings, and in seedlings treated with PEO- 
IAA (Fig. 4i, j, s). Together, these results indicate that CLASP, and TIR1 
auxin signalling-induced MAP65 action can define a microtubule- 
dependent axis of PIN polarity. 

PLT2-GR induction and NPA treatment remained able to reorient 
microtubules in mutants for multiple PINs®, and in pid wagl wag2 
mutants (Supplementary Fig. 10a-j), indicating PIN- and PID- 
independent microtubule reorientation, in line with previous work’. 

Shootward epidermal PIN2 localization increased (Fig. 4q) and clath- 
rin levels heightened at lateral cell sides compared to shootward and root- 
ward cell sides in root epidermal cells of clasp-1, 35S::MAP65-1,2 amiRNA 
and tirl-1 afb2-1 afb3-1 mutants (Fig. 4r), whereas clathrin transcript 
levels were unaltered (Supplementary Fig. 9h). These data indicated more 
PIN2 removal from lateral cell sides in those mutants than in wild type. 
PLT2-GR induction and co-treatment with inactive PEO-IAA in wild 
type shifted the predominant localization of clathrin heavy and light 
chains from the lateral to shootward-rootward cell sides of root epider- 
mal cells within the 16 h time frame that reorients microtubules (Fig. 4k, 0, 
t and Supplementary Fig. 9d, f, g). By contrast, predominant lateral loca- 
lization of clathrin in root epidermal cells remained unaltered after PLT2 
induction in clasp-1, in 35S::MAP65-1,2 amiRNA, in tirl-1 afb2-1 afb3-1 
mutant backgrounds, and in wild type after active PEO-IAA co-treatment, 
where PLT2 induction fails to reorient microtubules (Fig. 41-n, p, t and 
Supplementary Fig. 9a-c, e, g). Together, these findings indicate that 
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microtubule reorientation switches the axis of enhanced microtubule 
and clathrin localization, which in turn contributes to the switch in 
PIN2 localization by changing the axial bias of PIN2 endocytosis 
(Fig. 4u, v). 

Our findings connect CLASP- and MAP65-mediated microtubule 
array orientation to the establishment of the polarity axis in plant cells. 
The data suggest a mechanism in which CLASP controls microtubule 
organization over different surfaces. Given the enhanced microtubule 
co-alignments in cell faces perpendicular to the axis of polarity in 
clasp-1 mutant roots”, we predict a mechanism in which high trans- 
versal microtubule organization increases PID and clathrin levels, 
which locally stimulate PIN endocytosis. In accordance, enhanced 
clathrin localization and enhanced endocytosis have been shown to 
occur at the oriented microtubule dense regions during preprophase 
stage®’’. Differential endocytosis then favours PIN retention at the cell 
faces through which the axis of polarity passes. 

Our finding unravels the inner workings of the plant cell polariza- 
tion machinery by pinpointing a mechanism to restrict the cellular 
polarity axis. Furthermore, PID now seems to have two functions: 
defining the axis of polarity, and defining polarity itself within this 
polar axis**. Our data indicate that these functions (1) depend on the 
plasma membrane PID levels, which influence the axis of polarity by 
promoting endocytosis along axial cell faces, and (2) also control the 
polar recycling to one of two opposing cell faces by their absolute levels 
and phosphorylation activity. How the polarity within this axis is 
determined, how PID proteins perform both endocytosis and polar 
targeting functions, and whether PID action on clathrin-mediated 
endocytosis is locally regulated remains to be established. 


METHODS SUMMARY 

Plant material and growth conditions. All Arabidopsis lines are detailed in 
Methods. Seedlings were grown as described previously’®. All experiments repre- 
sent at least three independent biological replicates. 

PIN polarity. For PIN polarity quantification, shootward PIN2 percentages were 
quantified by counting the cells with shootward PIN2 and expressing these com- 
pared to the total number of cortical cells in the root apical meristem. 

Chemical treatments. IAA (Ducheva), NAA (Ducheva), NPA (Ducheva), IAM 
(Ducheva), BFA (Sigma), CHX (Sigma), oestradiol (Sigma), oryzalin (Sigma), 
PEO-IAA (a gift from K. Hayashi) and DEX (Sigma) were used from dimethyl- 
sulphoxide-dissolved stock solutions as indicated. 

Immunolocalization. Whole-mount immunolocalization was performed as des- 
cribed previously’®. 

Confocal image analysis. Fluorescence signal intensity was measured using the 
Leica (live) confocal software in a region of interest. Polar and lateral fluorescence 
intensity values were obtained by manually drawing lines on lateral and polar cell 
sides. Their ratios were obtained by averaging polar to lateral fluorescence inten- 
sity ratios of individual cells. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Plant material and growth conditions. Arabidopsis mutants clasp-1 (ref. 15), 
pid wagl wag2 (ref. 6), pin2 pin3 pin7 (ref. 8), pin2 pind pin7 (ref. 8), tir1-1 afb2- 
1 afb3-1 (ref. 18), scr’, wei-tar*, map65-1 map65-2 (ref. 24), 35S::MAP65-1,2 
amiRNA’*, and transgenic marker lines DR5::YFP-NLS*, WER::XVE > iaaH’, 
PIN2::PIN2-GFP**, PIN1::PIN1-GFP”’, PIN2::PIN1-GFP2 (ref. 9), PID::PID-YFP”*, 
CO2::XVE > PID-YFP®, G1090::XVE-PID®, 35S::PLT2-GR” and 35S::CLC-GFP*° 
have been described previously. 35S::PIN2-GFP was created by assembling entry 
vectors containing the 35S promoter, PIN2-GFP and NOS terminator into the 
destination vector. Oestradiol-inducible G1090::XVE >> GFP-CLASP was created 
by assembling entry vectors containing the G1090::XVE promoter, GFP-CLASP 
and NOS terminator into the destination vector. Transgenic marker lines in the 
clasp-1 background were generated by crossing. Homozygous mutant plants were 
then selected from the F, segregating population based on the genotyping, the 
typical clasp-1 phenotype, and the fluorescence of the marker. All seeds were surface- 
sterilized by gas sterilization protocol for 2h, and plated onto Murashige-Skoog 
(MS) medium, supplemented with 1% sucrose, and 9 gl? bacto-agar, pH 5.8. The 
plates were placed for the 24-h stratification period at 4 °C in the dark, then moved to 
a growth chamber (16-h light/8-h dark photoperiod at 23 °C) and kept vertical. 
All experiments were performed in at least three independent biological replicates. 
PIN polarity. For PIN polarity quantification, the cells within the root apical 
meristem zone were considered. Shootward PIN2 percentages were quantified 
by counting the cells with shootward PIN2 to the total number of cortical cells 
in the entire root apical meristem. 

Columella organization. For quantification of columella organization defects, 
cells showing abnormal shapes and more than four borders were counted as 
defective. 

Gravity stimulation. For gravity stimulation, 6-day-old seedlings were rotated by 
90° anticlockwise for 3h. For 48h gravistimulation, plates were rotated by 90° 
every 12 h (clockwise-anticlockwise). To measure the root tip bending, plates were 
photographed before and after the reorientation using a digital camera (Nikon 
D300S). The angle of root tip bending was assessed using Adobe Photoshop 
(version CS4.0). 

Chemical treatments and fluorescent dyes. [AA (Ducheva), NAA (Ducheva), 
NPA (Ducheva), IAM (Ducheva), BFA (Sigma), CHX (Sigma), oestradiol (Sigma), 
oryzalin (Sigma), PEO-IAA (a gift from K. Hayashi), and DEX (Sigma) were used 
from dimethylsulphoxide-dissolved stock solutions at 10 1M IAA, 10 uM NAA, 
10M NPA, 10M IAM, 50uM BFA, 50M CHX, 101M oestradiol, 21M 
oryzalin, 20 14M PEO-IAA and 10 uM DEX working concentrations for indicated 
periods. FM4-64 (Molecular Probes) dissolved in water was applied at a 2 [1M final 
concentration for 30min. Propidium iodide (Sigma) dissolved in water was 
applied to roots for imminent analysis. For auxin treatment, 5-day-old wild-type 
and clasp-1 seedlings were transferred onto MS media with 10 1M IAA, 10 1M 
NAA, 10 1M NPA or 10 1M IAM for 48 h. Wild-type and clasp-1 seedlings expres- 
sing PIN2::PIN2-GFP and PIN1::PIN1-GFP were treated in liquid medium with 
50 uM CHX for 30 min, followed by a 30-min treatment with 50 11M BFA and 
50 uM CHX. After BFA and CHX treatments, seedlings were transferred to MS 
liquid medium for a further 1-h washout in the presence of 50 14M CHX. For 
CO2::XVE >> PID-YFP and G1090::XVE-PID induction, seedlings were trans- 
ferred to MS media with 10 uM oestradiol for 24h. For columella organization 
and gravity analyses after PID induction, seedlings were germinated on 10 1M 
oestradiol-enriched media plates, and grown for up to 7 days. Oryzalin was applied 
to seedlings at 2 1M in MS liquid medium for 1 or 4h. For PLT2-GR induction, 
seedlings were transferred to MS media enriched with 10 1M DEX for 16-24 h. For 
NPA-triggered cell division switch analysis, seedlings were transferred to MS 
media enriched with 10 1M NPA for 60h. To observe PID::PID-YFP localization 
in response to PEO-IAA, its active and inactive forms were applied to seedlings at 
20 uM in liquid MS media for 8 h. For immunolocalization analysis, PEO-IAA was 
applied to seedlings for 24 h. For columella organization, PIN2 relocation and root 
apical meristem size observations after CLASP induction in clasp-1, seedlings were 
transferred to MS plates enriched with 10 [tM oestraidiol for the indicated periods. 
Starch granule staining. Six-day-old seedling roots were placed in Lugol’s solu- 
tion (Merck) for 2 min, then placed in chloral hydrate solution (80g chloral 
hydrate, 20 ml glycerol, 20 ml water). Differential interface contrast (DIC) micro- 
scopy analysis was performed immediately. 

EdU staining. Six-day-old seedlings of wild type, clasp-1, map65-1 map65-2, tirl 
afb2-1 afb3-1 and CO::XVE >> PID- YFP were processed using Click-iT RNA Alexa 
Fluor 488 HCS assay, as described previously”’. EdU-labelled root tips were imme- 
diately viewed by confocal laser scanning microscopy. 

Immunolocalization. Whole-mount immunolocalization was performed on wild 
type, clasp-1, pid wag1 wag2, scr, wei-tar, as well as clasp-1 mutants after BFA 
treatment and BFA washout. Furthermore, on wild type, clasp-1, pid wag] wag2, 
G1090::XVE-PID, 35S::PLT2-GR, tirl-1 afb2-1 afb3-1, 35S::MAP65-1,2 amiRNA, 


35S::PLT2-GR in clasp-1, tir1-1 afb2-1 afb3-1, 35S:;:MAP65-1,2 amiRNA, 35S::PLT2- 
GR and 35S::CLC-GFP after PEO-IAA treatment. Seven-day-old seedlings were 
fixed in 4% paraformaldehyde in MTSB buffer (50mM PIPES, 5mM MgSO,, 
5mM EGTA, pH6.9) as described previously’. Before their analyses, PLT2- 
GR was induced for 16h with 1014M DEX. Rabbit-anti-PIN2 (1:800) primary 
antibody”? or rabbit-anti-PIN1 (1:1,000) primary antibody (SCBT), in combina- 
tion with Cy3-conjugated anti-rabbit (1:500) secondary antibody was used for 
PIN2 or PIN1 detection, respectively. Rabbit-anti-clathrin (1:400) primary anti- 
body** in combination with either Cy3-conjugated anti-rabbit (1:500) secondary 
antibody (Sigma) or FITC-anti-rabbit (1:300) secondary antibody (Sigma) were 
used for clathrin heavy chain detection. Cells were then viewed by confocal laser 
scanning-microscopy. 

Microscopy and image analysis. Confocal laser scanning-microscopy was per- 
formed using Leica TCS SPII confocal microscope. Propidium iodide, GFP, YFP, 
FM4-64, Cy3 and FITC signals were excited using the Argon laser and collected at 
637-672, 498-523, 525-546, 585-663, 553-663 and 488-505 nm, respectively. All 
images were captured at X63 magnification. Finally, images were processed using 
Adobe Photoshop (version CS4.0). Fluorescence signal intensity was measured 
using the Leica (Live) confocal software in a region of interest. The polar and 
lateral fluorescence intensity values for PIN, PID, clathrin and microtubules were 
obtained by manually drawing lines on the lateral and polar cell sides (yellow and 
light blue dotted lines in Supplementary Fig. 2b). The polar and lateral fluo- 
rescence intensity ratios for PIN, PID, clathrin and microtubules were obtained 
by averaging the polar to lateral fluorescence intensity ratios of individual cells. 
The plasma membrane fluorescence intensity values for PIN2, PID, clathrin and 
FM4-64 were obtained by manually drawing lines on all cell sides (yellow dotted 
lines in Supplementary Fig. 2a, c, d). Fluorescence intensity values for the inter- 
nalized FM4-64 and PIN2 were obtained by manually drawing closed regions 
covering entire intracellular areas (light blue enclosed regions in Supplementary 
Fig. 2c, d). The percentage of internalized PIN2 (internalized PIN2/(internalized 
PIN2 + PIN2 at the plasma membrane) X 100) and internalized FM4-64 (inter- 
nalized FM4-64 /(internalized FM4-64 + FM4-64 at the plasma membrane) 
X 100)) were obtained by averaging the internalized to total fluorescence intensity 
ratios of PIN2 and FM4-64 for individual cells. The percentage of recycled PIN2 
(PIN2 at plasma membrane/(PIN2 at plasma membrane + PIN2 inside cell)) was 
obtained by averaging the recycled to total fluorescence intensity ratios of PIN2 for 
individual cells. Similar methodology was followed for quantifying PIN1 inter- 
nalization and recycling. For DIC microscopy, seedlings were placed into chloral 
hydrate solution and were observed with a Zeiss Axioplan II microscope equipped 
with DIC optics and a CCD camera. All DIC images were taken at X20 and x40 
magnifications. 

Total RNA isolation and cDNA synthesis. Total RNA was extracted from 
six-day-old wild-type and clasp-1 seedlings using the TRIZOL reagent (Sigma) 
following manufacturer’s instructions. Three micrograms of total RNA was 
digested with RNase-free DNase I (Fermentas). The absence of contaminating 
genomic DNA after DNase I treatment was confirmed by PCR analysis using 
primer pairs (FLC1-intron-F, FLC1-intron-R) designed to amplify a 200-base-pair 
(bp) intron sequence of the control gene, FLC1. For cDNA synthesis, 1 jg of total 
RNA was primed with odT18 primer (Biolegio) and reverse transcribed using 
RevertAID M-MuLV reverse transcriptase (Qiagen). The cDNA concentration 
was assessed either by amplification of a 400-600-bp segment of ACTIN 1 
cDNA (ACTI1-F, ACT1-R), or a 50-bp segment of ACTIN 2 cDNA (ACT2- 
1154-F, ACT2-1204-R). Only cDNAs with equal concentrations were considered 
suitable for RT-PCR and qRT-PCR analyses. For all primer sequences see 
Supplementary Table 1. 

RT-PCR analysis. A standard PCR protocol was applied to amplify all templates, 
followed by electrophoresis on 1.5% (v/w) agarose gel. For all primer sequences see 
Supplementary Table 1. 

qRT-PCR analysis. qRT-PCR analysis was carried out in ABI PRISM 7900 HT 
sequence detection system (Applied Biosystems) using SYBR Green to monitor 
double-stranded DNA synthesis. Reactions of 20 il total volume consisted of 2 
Fast SYBR Green reagent (Applied Biosystems), 20 ng cDNA and 200 nM of each 
gene-specific primer. The following standard PCR protocol was applied to amplify 
all templates: 50 °C for 2 min; 95 °C for 10 min; 40 cycles of 95 °C for 15 s and 60 °C 
for 1min, and SYBR Green fluorescence was measured continuously. Fluore- 
scence data for melting curves were acquired after 40 cycles by heating the sample 
up to 95°C for 15s, followed by cooling down to 60°C for 15s, and heating the 
samples to 95 °C for 15s. Data analysis was performed with the SDS 2.2.1 software 
(Applied Biosystems). To determine the threshold cycle value (C,) for each PCR 
reaction, the threshold (ARn) was set within the logarithmic amplification phase. 
PCR efficiency (E) was estimated using LinReg software with data obtained 
from the exponential phase of each individual amplification plot and the equa- 
tion: 1 + E= 10slope. Relative expression levels of genes of interest are given in 
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comparison to ACTIN 2 (ACT2-1154-F, ACT2-1204-R), and were calculated 


using the following formula: AAC, = power(1 + E; —AC)), in which AC, = Cygeney — 


Cyacrz)- For all primer sequences see Supplementary Table 1. 

Protein extraction and western blot. Five-day-old seedling was frozen in liquid 
nitrogen and ground into fine powder. The extraction buffer (50mM HEPES- 
KOH, pH7.5, 100mM KCl, 5mM EDTA, 5mM EGTA, 50mM NaF, 1% (v/v) 
IGEPAL, 0.5% (w/v) deoxycholate, 0.1% (w/v) SDS, 1 mM phenylmethylsulfonyl- 
fluoride (PMSF), 1X proteinase inhibitor cocktail (Roche), 1 phosSTOP(Roche)) 
was added to the powder to extract the protein. Protein extracts were spun twice for 
15 min at 16,000g at 4 °C and soluble protein was quantified with Bio-Rad protein 
assay reagent. Twenty micrograms of protein was separated by 10% (w/v) SDS- 
PAGE and transferred to nitrocellulose ECL membrane (Fischer Scientific). 
Western blots were probed with anti-YFP antibody (Roche) or anti-actin C4 
monoclonal antibody (Millipore). 

Accession numbers. The Arabidopsis Genome Initiative locus identifiers for the 
genes mentioned in this manuscript are as follows: CLASP (At2g20190), PINI 
(Atlg73590), PIN2 (At5g57090), PID (At2g34650), WAGI (At1g53700), WAG2 
(At3g14370), MAP65-1 (At5g55230), MAP65-2 (At4g26760), TIRI (Atlg72930), 
AEFB2 (At3g26810), AFB3 (Atlg12820), PLT2 (Atlg51190), CHC-1 (At3g08530), 
CHC-2 (At3g11130), CLC-2 (At2g40060), SCR (At3g54220), WEI-TAR (Atglg- 
70560), ACTIN 1 (At2g37620) and ACTIN 2 (At3g18780). 
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Conformational biosensors reveal GPCR signalling 


from endosomes 


Roshanak Irannejad', Jin C. Tomshine’, Jon R. Tomshine!, Michael Chevalier’, Jacob P. Mahoney’, Jan Steyaert*, 
Soren G. F. Rasmussen*®, Roger K. Sunahara’, Hana El-Samad?, Bo Huang” & Mark von Zastrow!’® 


A long-held tenet of molecular pharmacology is that canonical 
signal transduction mediated by G-protein-coupled receptor (GPCR) 
coupling to heterotrimeric G proteins is confined to the plasma 
membrane. Evidence supporting this traditional view is based on 
analytical methods that provide limited or no subcellular resolu- 
tion’. It has been subsequently proposed that signalling by inter- 
nalized GPCRs is restricted to G-protein-independent mechanisms 
such as scaffolding by arrestins”’, or GPCR activation elicits a dis- 
crete form of persistent G protein signalling*”®, or that internalized 
GPCRs can indeed contribute to the acute G-protein-mediated 
response’’. Evidence supporting these various latter hypotheses is 
indirect or subject to alternative interpretation, and it remains 
unknown if endosome-localized GPCRs are even present in an 
active form. Here we describe the application of conformation- 
specific single-domain antibodies (nanobodies) to directly probe 
activation of the B.-adrenoceptor, a prototypical GPCR", and its 
cognate G protein, G, (ref. 12), in living mammalian cells. We show 
that the adrenergic agonist isoprenaline promotes receptor and G 
protein activation in the plasma membrane as expected, but also in 
the early endosome membrane, and that internalized receptors con- 
tribute to the overall cellular cyclic AMP response within several 
minutes after agonist application. These findings provide direct 
support for the hypothesis that canonical GPCR signalling occurs 
from endosomes as well as the plasma membrane, and suggest a 
versatile strategy for probing dynamic conformational change in vivo. 

Ligand binding to the extracellular surface of the B,-adrenoceptor 
(B2-AR) stabilizes an activating conformational change in the receptor 
that promotes guanine nucleotide dissociation from the cytoplasmic 
GTP-binding protein Ga,; this represents the critical biochemical event 
initiating classical GPCR signal transduction (Fig. 1a)’. Activated B>- 
ARs are substrates for phosphorylation and binding of f-arrestins, 
events which inhibit interaction with G proteins and promote endocy- 
tosis of receptors via clathrin-coated pits (CCPs)'*!. Acute B,-AR Ga, 
signalling is thus traditionally thought to be restricted to the plasma 
membrane’*’*””. However, to our knowledge, this assumption has not 
been directly tested. To do so, we generated a biosensor of activated B- 
AR based on a conformation-specific single-domain camelid antibody 
(Nb80) used in recent structural studies'*!’. We reasoned that this 
nanobody, which selectively binds the agonist-occupied B.-AR and 
is able to stabilize an activated receptor conformation when present 
in vitro at high concentration, might act as a sensor of receptor activa- 
tion when expressed at relatively low concentration in intact cells 
(Fig. 1b). This proved to be the case; in cells maintained in the absence 
of agonist, Nb80 fused to enhanced green fluorescent protein (Nb80- 
GFP) localized to the cytoplasm and not with B,-ARs present in the 
plasma membrane (Fig. 1c, 0 min, top; Pearson’s coefficient = 0.135). 
Line scan analysis verified the cytoplasmic distribution of Nb80- 
GFP before B,-AR activation (Fig. 1d, top) as expected because the 


cytoplasmic concentration of Nb80-GFP achieved in our experiments 
(approximately 20 nM) was considerably lower than the equilibrium 
dissociation constant estimated in vitro for Nb80 binding to purified 
B>-ARs in the absence of agonist (0.76 + 0.14 uM; Supplementary Fig. 
la-d). After application of the adrenergic agonist isoprenaline (10 1M), 
Nb80-GFP was rapidly recruited to the plasma membrane and co- 
localized there with B.-ARs (Fig. 1c, middle; Pearson’s coefficient = 
0.625). Line scan analysis verified robust Nb80-GFP recruitment to 
the plasma membrane and concomitant depletion from the cytoplasm 
(Fig. 1d, middle), consistent with the much higher affinity of Nb80 for 
isoprenaline-activated B,-ARs (2.9 + 0.5 nM; Supplementary Fig. 1d). 
Agonist-induced membrane recruitment of Nb80-GFP was specific 
because the D1 dopamine receptor (DRD1), which is also G,-coupled 
but does not bind Nb80 in vitro (data not shown), failed to recruit 
Nb80-GFP to the plasma membrane in response to dopamine (10 11M) 
application (Supplementary Fig. 2). Furthermore, B.-AR-cyan fluor- 
escent protein (CFP) and Nb80-yellow fluorescent protein (YFP) gene- 
rated a pronounced fluorescence (Forster) resonance energy transfer 
(FRET) signal after isoprenaline application whereas DRD1-CFP did 
not (Supplementary Fig. 3a, b). 

B.-AR internalization began 1 to 2 min after Nb80-GFP recruit- 
ment to the plasma membrane, indicated by the emergence of surface- 
labelled B,-AR in peripheral cytoplasmic vesicles. Nb80-GFP did not 
co-localize with f-AR-containing endocytic vesicles upon first 
appearance (Fig. lc, middle, arrow in merged image points to an 
example) but was recruited at later time points (Fig. lc, bottom, 
Pearson’s coefficient = 0.702; examples are indicated by arrowheads). 
Endosome recruitment of Nb80-GFP was evident by line scan analysis 
(Fig. 1d, bottom; line scans are from the representative individual 
examples with further quantification in legend) and localized to 
EEAl-marked early endosomes (Pearson’s coefficient = 0.846; 
Supplementary Fig. 4) through which B,-ARs iteratively cycle in the 
presence of agonist*®. B,-AR-containing endosomes were initially 
devoid of Nb80-GFP and later acquired Nb80-GFP during their 
movement (Supplementary Videos 1 and 2). Interaction at endosomes 
was verified by B.- AR-CFP and Nb80-YFP normalized FRET (nFRET) 
(Supplementary Fig. 3c). These results suggest that B.-AR activation 
initiates a precisely choreographed series of events: Nb80-GFP is first 
recruited from the cytoplasm to the plasma membrane, then B,-ARs 
internalize devoid of Nb80-GFP, followed by a second phase of Nb80- 
GFP recruitment to the internalized B.-ARs. 

Nb80-GFP recruitment to endosomes required B,-ARs because a 
phosphorylation-deficient mutant version of the B,-AR ({,-AR-3S, 
with three serine mutations) that couples to Go, but is impaired in 
agonist-induced endocytosis” recruited Nb80-GFP to the plasma mem- 
brane but produced much less recruitment to endosomes (Fig. le, top). 
Nb80-GFP co-localized with B:-AR-3S after agonist-induced activation 
(Pearson’s coefficient = 0.674) but this was largely restricted to the 
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Figure 1 | Nb80-GFP detects activated B,-ARs in 
the plasma membrane and endosomes. a, The 
main events in B.-AR cAMP signalling include 
agonist binding (step 1), conformational activation 
of the receptor (step 2) that is coupled to 
conformational activation of G, (step 3) that 
produces guanine nucleotide exchange on G, and 
subsequent activation of adenylyl cyclase (AC) 
(step 4). b, Scheme for detecting conformational 
activation of B,-AR with Nb80-GFP. 

c, Representative Nb80-GFP (green) and B,-AR 
(red) localization at the indicated time (left) after 
10 uM isoprenaline addition (>30 Nb80-GEFP 
positive endosomes per cell observed at 20 min; 
n= 29 cells, 10 experiments). d, Representative 
individual Nb80-GFP line scans (shown at the 
same magnification as panel c). e, Representative 
Nb80-GFP (green) and B,-AR-3S (red) 
localization after 20 min of isoprenaline treatment 
(top) followed by reversal with 50 uM CGP-12177 
for the indicated times (6.4 Nb80-GFP positive 
endosomes per cell; n = 40 cells, 3 experiments). 
f, Representative individual Nb80-GFP line scans. 
g, Recovery of cytoplasmic Nb80-GFP 
fluorescence (black) or bleaching control of the 
plasma membrane f,-AR-3S (grey) 

(mean + s.e.m., m = 5 experiments). Scale bars, 

10 um. 


Figure 2 | Nb80-GFP accumulates on B,-AR- 
containing endosomes after their formation. 

a, B.-AR (red) and Nb80-GFP (green) at the 
indicated times after isoprenaline addition. Scale 
bar, 10 um. b, Average Nb80-GFP fluorescence 
measured in the TIRF illumination field at the 
indicated times (mean + s.e.m., 1 = 7 cells). 

c, TIRF image series showing B-AR (red) and 
Nb80-GFP (green) in sequential frames. 

d, Kymograph of an individual B,-AR-containing 
endosome (red, Alexa555) showing Nb80-GFP 
(green) acquisition over 4 min. e, Kymograph of 
Bo-AR (green, Alexa488) and Nb80-mRuby (red) 
over 6 min. 
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plasma membrane (line scan analysis in Fig. 1f is representative; further 
quantification in the figure legend). Nb80-GFP membrane recruitment 
was also reversible because the biosensor returned to a cytoplasmic 
distribution after addition of the competitive antagonist CGP-12177 
(Fig. le, f, middle and bottom rows; Pearson’s coefficient = 0.106), 
verified by recovery of Nb80-GFP fluorescence intensity in the cyto- 
plasm (Fig. 1g, see also Supplementary Video 3). Thus, Nb80-GFP 
detected activated B,-ARs both in the plasma membrane and endo- 
somes after acute agonist application. 

Discrete Nb80-GFP recruitment phases were clearly resolved by total 
internal reflection fluorescence (TIRF) microscopy that selectively detects 
events occurring in the plasma membrane and extending approximately 
100 nm into the peripheral cytoplasm~. First, within 2 min after agonist 
application, Nb80-GFP was progressively recruited to the plasma mem- 
brane in a diffuse distribution (Fig. 2a, compare first and second columns 
from left; quantification in Fig. 2b). Nb80-GFP did not co-localize with 
B2-ARs when they clustered in relatively static punctae characteristic of 
receptor-containing CCPs”* (Fig. 2a, third column). Second, over a period 
of several additional minutes, Nb80-—GFP was recruited to a discrete 
population of highly mobile punctae (Fig. 2a, fourth column) represent- 
ing peripheral §,-AR-containing endocytic vesicles”. 

Sequential TIRF imaging emphasized the distinction between rela- 
tively static B.-AR puncta not co-localizing with Nb80-GFP (Fig. 2c, 
arrowhead indicates an example) and mobile endosomes that did 
(Fig. 2c, arrow indicates a representative example; many are visible 
in Supplementary Video 4). Later recruitment of Nb80-GFP occurred 
rather suddenly, typically within approximately 5s (Supplementary 
Video 5), as also evident in kymographs of individual endosome trajec- 
tories (Fig. 2d). Ruling out potential artefacts of wavelength-dependent 
differences in TIRF microscopy illumination depth, later recruitment of 
the biosensor to endosomes was similarly observed when the excitation 
wavelengths used to detect receptor and biosensor were reversed (Fig. 2e). 

Nb80-GFP did not detectably concentrate in CCPs labelled with 
either of two independent markers, the adaptor protein B-arrestin-2 
(Fig. 3a and Supplementary Video 6) or the coat protein component 
clathrin light chain (Fig. 3b and Supplementary Video 7). Represen- 
tative examples are shown and this was verified across multiple cells 
and experiments (Pearson’s coefficient = 0.365 and 0.319, respect- 
ively, with numbers of replicates specified in the legend). In contrast, 
and as expected based on previous work, extensive co-localization 
of B.-AR with fB-arrestin was observed under the same conditions 
(Pearson’s coefficient = 0.677). Separation of Nb80-GFP localization 
from that of either B-arrestin or clathrin was clear in line scans (bottom 
panels of Fig. 3a, b were derived from the images shown and are 
representative, further quantification is in the legend). 

A simple interpretation is that Nb80-GFP associates with activated 
B2-ARs in the plasma membrane but then dissociates before receptors 
cluster in CCPs and internalize. This was surprising because B,-AR 
clustering in CCPs occurs much more rapidly” than the overall reversal 
rate of Nb80-GFP recruitment observed after agonist washout in 
intact cells (Fig. 1g), or the kinetics of Nb80 dissociation from purified 
B.-ARs in vitro (Supplementary Fig. 1b, e). One possibility is that 
Nb80-GFP dissociation is accelerated during the clustering process, 
by mechanisms such as receptor phosphorylation or steric exclusion 
mediated by f-arrestins or other CCP-associated components. An 
alternative possibility is that the fraction of $.-ARs that have bound 
Nb80-GFP in the plasma membrane by the time of the clustering 
reaction are unable to enter CCPs, and only those B-ARs not initially 
bound to Nb80-GFP in the plasma membrane are able to cluster in 
CCPs and subsequently internalize. In either case, the data clearly indi- 
cate that Nb80-GFP associates with B.-ARs after endocytosis, and after 
uncoating of the endocytic vesicle has occurred (Fig. 3c). Accordingly, 
Nb80-GFP recruitment to B,-AR-containing endosomes cannot repre- 
sent an artefact of persistent nanobody binding from the plasma 
membrane; instead, this observation reveals that B,-ARs present in 
early endosomes are in an activated conformation. 
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Figure 3 | Nb80-GFP does not accumulate in clathrin-coated pits or 
vesicles. a, Representative TIRF microscopy frames showing Nb80-GFP 
(green) and f-arrestin-2-mCherry (red) before and after agonist addition. 
Fluorescence intensity profiles are shown below for the indicated region and 
path; representative of n = 39 cells, 5 experiments and 4,849 punctae. 

b, Equivalent analysis comparing Nb80-GFP (green) to clathrin light chain- 
dsRed (red); representative of n = 26 cells, 3 experiments and 3,965 punctae. 
Arrowheads indicate examples of Nb80-GFP labelled endosomes. Scale bars, 
5 jum. c, Model for two phases of Nb80-GFP recruitment by the activated B.- 
AR, first at the plasma membrane and then at endosomes. 
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Because endosomes contain activated B,-ARs, we next asked if 
receptors engage their cognate G protein from this compartment. 
Heterotrimeric G proteins and adenylyl cyclase can be observed in 
endosomes as well as at the plasma membrane, supporting the concept 
of endosome-based G protein signalling**’°~*. To directly investigate 
the subcellular location of G protein activation, we developed a dis- 
tinct biosensor based on another nanobody, Nb37, which specifically 
recognizes the guanine-nucleotide-free form of Ga, representing the 
catalytic intermediate of G protein activation (Fig. 4a)”. We hoped 
that, because Nb37 binds a surface of the alpha-helical domain that is 
accessible only in the nucleotide-free form, we would be able to detect 
production of this critical but fleeting activation intermediate in living 
cells. This was indeed the case because Nb37-GFP localized in the 
cytoplasm of untreated cells and was rapidly recruited from the cyto- 
plasm to the plasma membrane in response to isoprenaline application 
(Fig. 4b, yellow arrowhead, Pearson coefficient = 0.627). Membrane 
recruitment of Nb37—GFP was agonist-dependent even in cells over- 
expressing Ga, and, notably, the cytoplasmic concentration of Nb37- 
GFP achieved in our experiments (also approximately 20 nM) was 
substantially lower than that producing detectable inhibition of G 
protein activation in vitro (Supplementary Fig. 5). Together, these 
observations suggest that Nb37-GFP indeed functions as a specific 
biosensor of G, activation under the experimental conditions used. 

Nb37-GFP was recruited not only to the plasma membrane (Fig. 4b, 
yellow arrowhead) but also to B,-AR-containing endosomes. Notably, 
endosome recruitment of Nb37-GFP occurred after the appearance of 
B2-ARs in the endosome. Such later recruitment was evident in dual 
label confocal image series (Fig. 4b shows an example: white arrowhead 
indicates recently internalized B.-AR, white arrows indicate Nb37-GFP 
recruitment, Pearson coefficient = 0.710; see also Supplementary Video 
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8). Nb37-GFP recruitment was specifically dependent on receptor 
activation because a mutant B-AR that is defective in G-protein coup- 
ling (B:-AR-Cys 341 Gly)** did not produce detectable recruitment 
(Supplementary Fig. 6), whereas the distinct G,-coupled DRD1 
recruited Nb37-GFP to both the plasma membrane and endo- 
somes (data not shown). Nb37-GFP localized uniformly on B-AR- 
containing endosomes in cells overexpressing Ga, (Fig. 4b and 
Supplementary Video 8) but was recruited non-uniformly in cells 
expressing endogenous levels (Fig. 4c, arrowhead indicates an 
example). Moreover, Nb37-GFP recruitment to endosomes had a 
scintillating appearance, fluctuating rapidly when viewed in live image 
series (Supplementary Video 9), suggesting that G protein activation 
detected by Nb37-GFP occurs dynamically from limited regions of the 
endosome membrane. 

We then asked if active B.-AR G, coupling from endosomes con- 
tributes to the cellular response, focusing on cAMP as a classical 
second messenger carrying the downstream signal, using a previously 
described real-time assay of cAMP accumulation in living cells and 
normalizing to the signal produced by receptor-independent activa- 
tion of adenylyl cyclase with forskolin (5 |1M)’’. Isoprenaline produced 
a rapid increase in cytoplasmic cAMP accumulation, reaching a maxi- 
mum within approximately 10 min (Fig. 4d, blue line). Dyngo-4a, a 
dynamin inhibitor that blocks B,-AR endocytosis by inhibiting CCP 
function*”’, did not detectably affect the earliest portion of the 
forskolin-normalized cAMP accumulation curve but significantly 
reduced its later rise (Fig. 4d, green line, grey dots indicate P values), 
consistent with the two phases of endosome-localized activation 
detected by the nanobody-derived biosensors. A similar inhibition 
was observed in cells expressing a mutant B,-AR with a carboxy- 
terminal alanine residue added that prevents efficient recycling 


Figure 4 | Internalized B,-ARs contribute to the 
acute cAMP response. a, Scheme for detecting 
conformational activation of G, with Nb37-GFP. 
b, Confocal image frames showing Nb37—GFP 
(green) and B,-AR (red) at the indicated time 
points after isoprenaline addition (representative 
of n = 14 cells; estimated Nb37-GFP recruitment 
delay ranged from 0.7 to 2.65 min). Scale bar, 5 jum. 
Yellow arrowhead indicates Nb37—GFP 
recruitment to the plasma membrane. White 
arrowhead indicates an endosome containing 
recently internalized B2-AR and not associated 
with Nb37-GEP, white arrows indicate Nb37—GFP 
recruitment. c, Representative confocal frames 
showing a discrete endosomal structure labelled 
with Nb37-GFP (green) and B,-AR (red) at higher 
magnification. Scale bar, 2 1m. Arrowhead 
indicates non-uniform localization of Nb37-GFP 
to the endosome. d, Forskolin-normalized f2-AR- 
mediated cAMP response in the absence or 
presence of 30 uM Dyngo-4a (mean + s.e.m., 

n= 10 experiments, P values in grey). 

e, Isoprenaline (20 min)-induced B,-AR and B3- 
AR-3S internalization measured by flow cytometry 
(n = 4 experiments). f, Forskolin-normalized B,- 
AR-3S-mediated cAMP response in the absence or 
presence of 30 1M Dyngo-4a (n = 8 experiments; 
P=0.1192). g, Model for two phases of B2-AR G, 
activation, first at the plasma membrane and then 
at endosomes, separated by the endocytic event. 
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(B2-AR-Ala)*® (Supplementary Fig. 7), suggesting that the Dyngo- 
sensitive component of the cAMP response did not represent a secon- 
dary consequence of resensitization by receptor recycling’’. Dyngo-4a 
did not produce the same effect on cAMP accumulation elicited by 
the internalization-defective B,-AR-3S mutant receptor’ (Fig. 4e, f), 
further supporting the conclusion that the second signalling phase 
indeed requires receptor localization in endosomes. 

The present findings revise a long-held tenet of molecular phar- 
macology, that acute signal transduction mediated by the canoni- 
cal B.-AR G, signal transduction mechanism is plasma membrane 
delimited. The results provide direct evidence supporting the hypo- 
thesis that B.-AR endocytosis contributes to a second phase of the 
acute cellular cAMP response, which represents a significant com- 
ponent of the overall biochemical signal developed within several 
minutes after the initial agonist application. Thus, although it remains 
clear that B,-ARs can elicit G,-mediated signal transduction from the 
plasma membrane, the present data reveal a discrete component of the 
acute signalling response that is initiated from endosomes (Fig. 4g). It 
remains unknown if B.-ARs are continuously bound by agonist in 
endosomes, as depicted in the figure for simplicity, but conforma- 
tional activation of G, in endosomes is both receptor- and agonist- 
dependent. Unambiguous detection of endosome-based activation of 
acute G-protein-linked signalling is presently limited to the B,-AR 
G, system for which the critical nanobodies are available. However, 
endosome-based contribution to the acute signalling response is pro- 
bably widespread in the GPCR superfamily because the B-AR belongs 
to the largest group (family A) of GPCRs and is often considered a 
prototype. We also suggest, more generally, that nanobody-based bio- 
sensors represent a versatile strategy for probing other types of dyna- 
mic conformational change with high spatiotemporal resolution in 
living cells. 


METHODS SUMMARY 


Experiments were carried out using human embryonal kidney HEK293 cells 
(ATCC) expressing the indicated receptor constructs and nanobodies fused to 
enhanced GFP. Optical imaging was carried out at 37 °C in DMEM not containing 
phenol red and supplemented with 30 mM HEPES. Live cell cAMP accumulation 
was assessed at 37 °C using pGlosensor-20F (Promega). Flow cytometry was 
carried out using Alexa647 (Invitrogen)-conjugated M1 anti-Flag monoclonal 
antibody (Sigma) and a FACSCalibur instrument (Becton Dickinson). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Cell Culture, cDNA constructs and transfection. HEK293 cells were grown in 
DMEM supplemented with 10% FBS (UCSF Cell Culture Facility) without anti- 
biotics. Stably transfected HEK293 cell clones expressing Flag-tagged B,-AR-3S 
were created using previously described Flag-tagged B.-AR”. A plasmid encoding 
a cyclic permuted luciferase reporter construct, based on a mutated RIIBB cAMP- 
binding domain from PKA (pGloSensor-20F, Promega). Nb80-eGFP and Nb37- 
eGEP were created by amplifying Nb80 and Nb37 nanobody complementary 
DNAs using 5'-CTTGAAAAGCTTGCCGCCACCATGGGACAGGTGCAGCT 
GCA-3'; 5'-TTCAAGGGATCCATGTGATGGTGATGGTGGTGTGAGGAGA 
CGGT-3' and 5’-CTTGAAAAGCTTGCCGCCACCATGGGACAGGTGCAGC 
TGCA-3'; 5'-TTCAAGGGATCCATGTGATGGGCTTCAGGTTCGTGATGG 
TGATG-3’ primers, respectively, and cloning into the peGFP-N1 vector using 
HindIII and BamH1. B-arrestin-2-GFP, clathrin-DsRed, EEA1—DsRed and Go,- 
HA were gifts from M. Caron, W. Almers, K. Mostov and P. Wedegaertner, 
respectively. B-arrestin-2-mCherry was generated by subcloning -arrestin-2 to 
pmCherry (Clontech) and f-AR-CFP was generated from the Flag-tagged B-AR 
construct. Transfections were performed using Lipofectamine 2000 (Invitrogen) 
according to the manufacturer’s instructions. Flag-tagged human B2-AR and f>- 
AR-3S (Ser 355 Gly, Ser 356 Gly and Ser 364 Gly were mutated simultaneously) 
constructs were labelled with Alexa555- or Alexa488-conjugated M1 anti-Flag 
monoclonal antibody (Sigma) as described previously”. 

Live-cell confocal imaging. Live cell imaging was carried out using Yokagawa 
CSU22 spinning disk confocal microscope with a X100, 1.4 numerical aperture, 
oil objective and a CO, and 37 °C temperature-controlled incubator. A 488 nm 
argon laser and a 568 nm argon/krypton laser (Melles Griot) were used as light 
sources for imaging GFP and Flag signals, respectively. Cells expressing both Flag- 
tagged receptor and the indicated nanobody-GFP were plated onto glass cover- 
slips. Receptors were surface labelled by addition of M1 anti-Flag antibody 
(1:1,000, Sigma) conjugated to Alexa 555 (A10470, Invitrogen) to the media for 
30 min, as described previously”’. Indicated agonist (isoprenaline, Sigma) or antago- 
nist (CGP-12177, Tocris) were added and cells were imaged every 3 s for 20 min in 
DMEM without phenol red supplemented with 30 mM HEPES, pH 7.4 (UCSF Cell 
Culture Facility). Time-lapse images were acquired with a Cascade II EM charge- 
coupled-device (CCD) camera (Photometrics) driven by Micro-Manager 1.4 
(http://www.micro-manager.org). 

Live cell TIRF microscopy. TIRF imaging was carried out as described 
previously*’. Briefly, HEK293 cells co-expressing either Nb80-eGFP and 
f-arrestin-2-mCherry or clathrin light chain-DsRed, were imaged in DMEM 
without phenol red supplemented with 30mM HEPES, pH7.4 (UCSF Cell 
Culture Facility). Imaging was carried out using a Nikon TE-2000E inverted 
microscope with a X100, 1.49 numerical aperture TIRF objective, equipped for 
through-the-objective TIRF illumination, a 37°C temperature-controlled stage 
(Bioscience Tools) and an objective warmer (Bioptechs). A 488 nm argon laser 
(Melles Griot) and a 543 nm helium-neon laser (Spectra Physics) were used as light 
sources. Time-lapse sequences were acquired with a C9100-12 camera 
(Hamamatsu Photonics) driven by iQ software (Andor). Cells were imaged every 
3s for 20 min. 

Image analysis and statistical analysis. Images were saved as 16-bit TIFF files. 
Quantitative image analysis was carried out on unprocessed images using ImageJ 
software (http://rsb.info.nih.gov/ij). Co-localization analysis was estimated by cal- 
culating the Pearson’s correlation coefficient between the indicated image chan- 
nels using the co-localization plug-in for Image]. Analysis of Nb80-GFP intensity 
profile along the straight line and Nb80-GFP/f-arrestin or Nb80-GFP/clathrin 
along the segmented line were carried out using the ImageJ plot profile function. 
For estimating changes in Nb80-GFP surface fluorescence over time in TIRF 
images, individual cells were selected manually and fluorescence values measured 
over the entire stack. A blank area of the image lacking cells was used to estimate 
background fluorescence. Average fluorescence intensity was measured in each 
frame, background-subtracted and normalized to the maximum value. P values are 
from one-tailed unpaired Student’s t-tests. For visual presentation (but not quanti- 
tative analysis), image series were processed using Kalman stack filter in Image]. 
Luminescence-based rapid cAMP assay. HEK293 cells were transfected with a 
plasmid encoding a cyclic-permuted luciferase reporter construct, based on a 
mutated RIBB cAMP-binding domain from PKA (pGloSensor-20F, Promega), 
which produces rapid and reversible cAMP-dependent activation of luciferase 
activity in intact cells. Cells were plated in 24-well dishes containing approximately 
200,000 cells per well in 500 pl DMEM without phenol red and no serum and 
equilibrated to 37 °C in a light-proof cabinet. An image of the plate was focused 
ona 512 X 512 pixel electron multiplying CCD sensor (Hamamatsu C9100-13), 
cells were equilibrated for 1h in the presence of 250 1g ml * luciferin (Biogold), 
and sequential luminescence images were collected every 10s to obtain basal 
luminescence values. The camera shutter was closed, the cabinet opened and 
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the indicated concentration of isoprenaline was bath applied, with gentle manual 
rocking before replacing in the dark cabinet and resuming luminescence image 
acquisition. In endocytic manipulation experiments, cells were pre-incubated 
with 30 4M Dyngo-4a (abcam Biochemicals) for 15 min. Every 10s, sequential 
images were acquired using Micro-Manager (http://www.micro-manager.org) 
and integrated luminescence intensity detected from each well was calculated after 
background subtraction and correction for vignetting using scripts written in 
MATLAB (MathWorks). In each multiwell plate, and for each experimental con- 
dition, a reference value of luminescence was measured in the presence of 5 1M 
forskolin, a manipulation that stimulates a moderate amount of receptor-inde- 
pendent activation of adenylyl cyclase. The average luminescence value—mea- 
sured across duplicate wells—was normalized to the maximum luminescence 
value measured in the presence of 5 tM forskolin. 

FRET imaging. FRET imaging was carried out as described previously’®. Briefly, 
HEK293 cells co-expressing §2-AR-CFP or DRD1-CFP and Nb80-YFP were 
imaged in wide field at 37 °C using a shuttered mercury arc lamp and standard 
CFP excitation (ET430/24X) and YFP emission (ET500/20) band pass filters 
(Chroma). YFP emission was collected using a 535/30 m filter, and CFP emission 
was collected through a 470/24 m filter. Corrected FRET ratios were obtained 
using the following equation: NFRET = [(Ipger — BGgrer) — (Icrp — BGcrp) X 
BTponor ~ (lyrp—BGyrp) * DEacceprorl/Icep- BTponor, donor bleed 
through; DEaccepror, direct excitation of the acceptor; BGx, background fluor- 
escence; and Ix, integrated fluorescence intensity measured in a given channel. 
Flow cytometric assay of receptor endocytosis. Surface fluorescence of Flag-B,- 
AR or Flag-B2-AR-3S expressing HEK293 cells was used to measure receptor 
endocytosis. Cells were incubated with 101M isoprenaline for 20 min at 37 °C 
to drive receptor internalization to steady state and were subsequently rinsed 3 
times with ice-cold PBS, then mechanically lifted and incubated with lpg ml} 
Alexa647 (Invitrogen)-conjugated M1 anti-Flag monoclonal antibody (Sigma) 
at 4°C for 1h. Mean fluorescence intensity of 10,000 cells was measured using 
a FACSCalibur instrument (Becton Dickinson). Each condition was performed 
in triplicate. 

Enhanced GFP calibration. Recombinant eGFP (BioVision) was used for cal- 
ibrating average fluorescence intensity of the biosensors, imaged in confocal 
optical sections through the cytoplasm of cells not exposed to agonist (to achieve 
diffuse cytoplasmic distribution of the biosensors). eGFP was diluted in Hank’s 
balanced salt solution and confocal sections were imaged through droplets of each 
using the same illumination and acquisition parameters as for imaging the bio- 
sensors in cells. For each cell, a background fluorescence value was determined 
by average fluorescence intensity of a blank region in the same image. The cyto- 
plasmic concentration of biosensors was estimated by interpolation of the back- 
ground-subtracted value using a linear least-squares fit to the standard plot. 
Generation of B,-AR-rHDL nanoparticles. Apolipoprotein-AI (Apo-AI) was 
biotinylated using NHS-PEG4-biotin (Pierce Biotechnology) at a 1:1 molar ratio. 
Following a 30-min biotinylation reaction at room temperature, the sample was 
dialysed to remove free biotin. Flag-tagged $2-AR was incorporated into recom- 
binant high density lipoprotein (rHDL) particles as previously described*** using 
biotinylated Apo-AI. Receptor-containing particles were then purified by M1 
anti-Flag immunoaffinity chromatography”*. Particles containing purified mono- 
bromobimane-labelled B,-AR were generated similarly except not using biotiny- 
lated HDL, with receptor labelling and fluorescence analysis carried out as 
described previously"®. 

Assessing Nb80 binding to immobilized B,AR-rHDL. Nb80 binding to unli- 
ganded and agonist-occupied B,-AR was measured using the OctetRED biolayer 
interferometry system (Pall FortéBio). In this assay, a target protein is immobilized 
on the functionalized tip of a fibre optic probe that is dipped into an analyte 
solution to observe analyte association to the target protein. A dissociation step 
is then performed by transferring the biosensor into buffer lacking analyte. 
Analyte association/dissociation is measured by monitoring changes in the inter- 
ference pattern of a light beam reflected from the biosensor tip as the total mass 
bound at the tip surface changes*’. Streptavidin-coated biosensors (Pall FortéBio) 
were loaded with biotinylated B.-AR-rHDL particles for 15 min at room temper- 
ature and the biosensors were transferred to the OctetRED instrument. Sensors 
were placed into assay buffer (20mM HEPES, pH7.7, 100mM NaCl, 1mM 
EDTA, 0.02% (w/v) ascorbic acid, 0.05% (w/v) BSA) with or without 100 uM 
isoprenaline for 30 min. To measure Nb80 association, the sensor was transferred 
to assay buffer with Nb80 (at indicated concentrations) for 5 min, followed by a 
30 min dissociation step in assay buffer. Isoprenaline (100 1M) was included in the 
association and dissociation steps when measuring Nb80 binding to agonist-occu- 
pied receptor. All experiments were carried out at 25°C with the assay plate 
shaking at 1,000r.p.m. Buffer-only controls were included in each experiment 
to monitor for baseline drift, and nonspecific Nb80 binding was measured in a 
parallel assay using sensors loaded with empty rHDL particles. Raw data were 
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processed to remove baseline and nonspecific binding using Octet Data Ana- 
lysis 7.0 software (Pall FortéBio) and exported to Prism 5 (GraphPad) for curve 
fitting. All association and dissociation curves were fit using a single-phase 
exponential association or decay curves, respectively. Equilibrium binding 
affinity of Nb80 for B.-AR in the presence or absence of the agonist isoprenaline 
was assessed by monitoring the maximal interference shift generated by Nb80 
binding (at varying Nb80 concentrations) to the probe containing B)-AR recon- 
stituted in rHDL. The maximal shift was plotted against the Nb80 concen- 
tration and fitted by nonlinear regression in Prism 5 (GraphPad) to generate 
the apparent affinity. 

Inhibition of bodipy-GTPyS-FL binding by Nb37. The effect of Nb37 on GTP 
loading of purified G proteins was measured using 100nM bodipy-GTPyS-FL 
(Invitrogen) essentially as described”. In this assay we used the fluorescence 
emission of bodipy-GTPyS-FL (A.,.~470 nm, Aem~515 nm) that accompanies 
binding of the labelled nucleotide to G protein’*. Briefly, the fluorescence of 
100 nM bodipy-GTPyS-FL was measured in the presence of 1 mM of the indicated 
G protein using a 96-well microtitre plate format on a M5 fluorescence plate reader 
(Molecular Devices). Nb37 was added together with bodipy-GTP7, -FL and the 
binding reaction was initiated by the addition of G protein (1 mM) in 20 mM Tris- 
HCl, pH8.0, 3mM MgCl, 1mM dithiothreitol in a final volume of 200 ml. 
Bodipy-GTPyS-FL binding to heterotrimeric G protein included 0.1% dodecyl- 
maltoside (final). Ga, was purified as described*’. Ga,By was purified as 
described'*. Myristoylated Go; was purified as described**. The time scans were 
limited to 240s to minimize the accumulation of hydrolysis of the product of 
bodipy-GTPyS-FL, bodipy-phosphate"’. 
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Open to possibilities 


Opting for open access means considering costs, journal prestige and career implications. 


BY STEPHEN PINCOCK 


ot so long ago, Mike Taylor gave no 

| \ | thought to open access when submit- 
ting research papers for publication. “I 

just sent them to the well-respected journals,” 
says Taylor, a palaeontologist at the University 
of Bristol, UK. “It’s only gradually, really, that 
it became apparent to me how stupid that was.’ 
As he started to think more carefully about 
where he published his work, his contempla- 
tion turned into fully fledged advocacy. Taylor 
would like to see free, unrestricted access to 
all scholarly papers online. In an article on the 
website of UK newspaper The Guardian this 


January, he argued that hiding publications 
behind a paywall is “immoral” (see go.nature. 
com/v9fmtm). But within hours of appearing, 
the article was accumulating comments that 
laid out caveats that he had not considered. 
The point that raised the most ire was Tay- 
lor’s argument that scientists no longer need to 
publish in prestigious journals to boost their 
careers, which commenters vigorously refuted. 
Taylor was forced to rethink his position on 


THE FUTURE OF PUBLISHING 


A Nature special issue. 


that: journal prestige remains important, he 
realized. “I got quite a lot of criticism from 
people I respect a lot,” he says. 

Early-career scientists face a pressing ques- 
tion: how should they publish to advance their 
careers at a time when the scholarly world is 
being shaken up? New options in publishing 
have highlighted sticky debates related to the 
impacts of costs on individual researchers and 
the ethics of business models that can keep 
cash-poor scientists from accessing data. But 
many researchers contend that impact factors 
and other metrics of journal prestige remain 
crucial — for now. 

Over the past 20 years, open-access > 
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> publishing has become a major part of 
the scholarly landscape. It is now common in 
astronomy, maths and physics, where most 
researchers submit their work to the open- 
access repository arXiv.org before it is pub- 
lished, and is on the rise in the life sciences and 
other fields. Over the past decade, open-access 
publishing has increased its share of articles by 
about 1% a year (see page 425). Around 17% 
of the 1.66 million articles indexed by abstract 
and citation database Scopus in 2011 were 
freely available from journal publishers’. 

Worldwide, more than 200 institutions and 
80 research funders require their researchers’ 
work to be open access, according to the Roar- 
map registry (roarmap.eprints.org). For exam- 
ple, from 1 April, researchers supported by any 
of the seven UK research councils will be asked 
to publish their work in a journal that either 
provides immediate and unrestricted access 
to the final published version of the paper, or 
consents to the manuscript being deposited 
in an open-access repository within a certain 
time — six months for science papers. The 
US National Institutes of Health requires that 
scientists submit final peer-reviewed journal 
manuscripts arising from agency-funded work 
to the digital archive PubMed Central, and that 
those papers are made available to the public 
within one year of publication. 


THE POWER OF PRESTIGE 
Not everyone shares Taylor’s moral outrage 
over the need for open access. Many senior 
researchers have simple advice, especially for 
early-career scientists: go to the best journal 
you can publish in. Rob Brooks, an evolutionary 
scientist at the University of New South Wales in 
Sydney, Australia, supports open access in prin- 
ciple, but says that career building still relies on 
established models of prestige. “Journal quality 
remains the benchmark for that piece of work 
and that’s what people will be assessed by,’ he 
says. “Impact factors still pretty much rule. A lot 
of people — grant committees, administrators 
and even referees — can't assess quality. All they 
can do is count or pseudo-quantify. They count 
the number of papers you've got and count the 
impact factors of the papers and make a little 
metric, rather than just reading the papers.” 
Early-career scientists should be wary about 
turning away from high-impact journals to 
publish in open-access outlets with lesser 
reputations, says Chris Chambers, a neuro- 
scientist at Cardiff University, UK, and an aca- 
demic editor at PLoS ONE, who was among 
those who critiqued Taylor’s post. “On the one 
hand, this is a noble act that is impossible not 
to applaud,” he says. “On the other hand, there 
is a risk that such researchers deal themselves 
out of the game, being overtaken in the race to 
senior positions by more careerist colleagues.” 
But going for high-impact journals need not 
rule out open-access, says Peter Suber, director 
of the open-access project at Harvard Univer- 
sity in Cambridge, Massachusetts. He suggests 


looking through the Directory of Open Access 
Journals (www.doaj.org) for those with edi- 
tors and published authors who are familiar 
and respected in the field. Suber also points 
out that publishing in a subscription journal 
does not mean that an article cannot be made 
freely available online later; most allow pre- or 
post-print archiving in open-access reposito- 
ries such as those listed at www.opendoar.org. 
The open-access journal eLife published its 
first articles only last October, but can already 
boast a measure of prestige. Although it is too 
early to gauge its impact factor, the journal 
seems likely to benefit from the reputation of 
its founders, research funders the Wellcome 
Trust in London, the Howard Hughes Medi- 
cal Institute in Chevy Chase, Maryland, and 
the Max Planck Society in Munich, Germany. 
“eLife is ... an obvious example of a brand-new 
journal with no branding, no standing in the 
world? says Peter Binfield, publisher of the sep- 
arate open-access journal Peer], based in San 
Francisco, California, “and yet it’s clear that this 
is a good place to publish, because it’s got some 
big-name backers.” 
Although the 
established hierar- 
chies of prestige may 
still hold sway in 
academia, the land- 
scape is shifting, says 
Robert Kiley, head 
of digital services 
at the Wellcome 
Trust Library. Many 
funders are looking 


“If you come to beyond a journal’s 
the Wellcome f Or brand name. “If you 
agre ant, funding come to Wellcome 
decisions are for a grant,” he says, 
based onthe “we make it clear that 
intrinsic merit funding decisions are 
of the work, and __ based on the intrinsic 
not the title of merit of the work, 
the journal.” and not the title of 
Robert Kiley the journal in which 


an author’s work is 
published.” Kiley points to the policies of the 
UK programme for assessing research quality, 
the Research Excellence Framework, which 
stated in July 2012 that no grant-review sub- 
panel “will make any use of journal impact fac- 
tors, rankings, lists or the perceived standing of 
publishers in assessing the quality of research 
outputs” (see go.nature.com/xgsses). 

In principle, open-access publishing can 
connect researchers to a wider readership. 
Some studies suggest that articles in open- 
access journals or repositories are accessed 
more often and reach a broader audience than 
those in subscription-only journals. Whether 
this translates into higher citation rates is up for 
debate. In 2010, a meta-analysis found 27 stud- 
ies showing that open-access articles had more 
citations than papers behind paywalls — up 
to 600% more, depending on the field — and 
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four that found no open-access advantage’. 

Some journals, including PLoS ONE and 
Peer], tally metrics such as the number of peo- 
ple who have clicked on each paper, and the 
paper's visibility on social-media websites such 
as Twitter and Facebook. Ross Mounce, a pal- 
aeontology PhD student at the University of 
Bath, UK, thinks that such alternative metrics 
are already beginning to free researchers from 
conventional measures of prestige. “I think if 
you have faith in your own work, then you can 
publish it wherever,” he says, countering com- 
ments on Taylor's article. “And as long as people 
can access it and read about it and know about 
it, and can discover your work, then that’s all 
you need.” 


WEIGHING THE COST 

Many researchers contemplating open access 
are concerned about having to pay publication 
costs. As of August 2011, about one-quarter 
of open-access journals said that they charged 
article-processing fees; in 2010, those fees 
ranged between US$8 and $3,900, with the 
average around $900 (ref. 3). 

According to Carl Bergstrom, an evolution- 
ary biologist at the University of Washington 
in Seattle, there tends to be a positive correla- 
tion between an open-access journal's fees and 
its score in a system he co-developed that rates 
journals according to the number of citations 
they receive, with citations from highly ranked 
journals weighted more heavily. However, not 
all expensive journals have high ratings; an 
interactive tool shows that there is a great deal 
of variation (www.eigenfactor.org/openaccess). 

In any case, Suber notes that only a little over 
10% of article-processing fees are paid out of 
pocket by authors*. “Most of the time, those fees 
are paid on behalf of the author by the author's 
employer or the author's funding agency,’ he 
says. Scientists should check with their fund- 
ing agencies directly, but open-access publisher 
BioMed Central keeps an incomplete list of 
those that will pay (see go.nature.com/qpjxoi). 
And some publishers reduce or waive fees for 
researchers who cannot afford to pay. 

Peer], which published its first papers last 
month, has taken a completely new approach 
to cost. Rather than charging a fee per paper, it 
offers lifetime membership, ranging from $99 
for one publication per year to $299 for unlim- 
ited publications (see Nature 486, 166; 2012). All 
memberships include the right to post articles 
on a non-peer-reviewed preprint server. For a 
paper to be published, all authors must be mem- 
bers (or at least 12 authors must be, for papers 
with more). “We've deliberately tried to strip 
out all the extraneous costs from the system to 
make it as efficient and cheap as possible froma 
researcher's point of view,’ says Binfield. “Then 
we've layered on this different business model, 
which is a payment per person per membership 
rather than payment per publication” 

Bo-Christer Bjork, an information systems 
scientist at Hanken School of Economics in 


WELLCOME LIBRARY/WELLCOME IMAGES 


Helsinki, says that the number of subscrip- 
tion journals offering open access for a fee 
has doubled in recent years, and currently 
stands at more than 4,300. However, just 
1-2% of eligible authors take up that open- 
access option’. Bjork argues that the high 
costs are a deterrent — about $3,000 is typi- 
cal. “There are, however, a few publishers 
and individual journals with a much higher 
uptake,” he says. For example, Nature Com- 
munications, which was launched in 2010 
by Nature Publishing Group (NPG), is a 
multidisciplinary journal that takes this 
hybrid approach. When it started, about 
half of authors chose the open-access 
option. But those numbers have fluctu- 
ated, says James Butcher, associate director 
of open publishing at NPG, and in the past 
six months about 30% have chosen open 
access. Butcher has not collected data to 
explain the trend, but speculates that it 
might be attributable to the journal's rela- 
tively high fees of around $5,000, to chang- 
ing author demographics or to a general 
drop in interest after early excitement about 
the open-access option. 

Chambers suggests that paying a fee to 
publish in a hybrid journal is a good way to 
achieve both accessibility and prestige. “The 
key factor is whether the researcher has 
the funds available to do so, and whether 
the funding agency requires papers to be 
open access,” he says. But where funds have 
not been set aside for article-processing 
charges, and young scientists would need to 
spend research money, Chambers advises 
against consenting to high publishing fees. 

“As a young investigator you have to do 
what’s economically viable,’ says Stephen 
Macknik, a neuroscientist at the Barrow 
Neurological Institute in Phoenix, Ari- 
zona. Paying an article-processing charge 
for a reputable open-access journal may be 
a good middle ground for young research- 
ers, he says. 

But scientists shouldn't sacrifice funding 
that was meant for research. “To maximize 
their competitiveness it is vital that young 
researchers maintain a productive profile of 
high-quality research, and this means using 
research funds to do as much high-quality 
research as possible,’ says Chambers. “It 
falls to the more senior scientists to change 
the system? m 


Stephen Pincock is a freelance writer 
based in Sydney, Australia. 
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TURNING POINT 
Kate Rubins 


After 15 years in infectious-disease biology, 
Kate Rubins jumped at the chance to fulfil her 
childhood dream of becoming an astronaut. 


Did you always want to become an astronaut? 
As a kid, I really did, but various people 
pointed out that it was not the most realistic 
career choice. When I was 16, my dad took 
me toa DNA conference at the Exploratorium 
science museum in San Francisco, California, 
and I was captivated by this way of looking at 
biology and by the discussions of bits of nucleic 
acid that could make us sick. 


How did you come to focus on research related 
to public health? 

As an undergraduate majoring in biology at the 
University of California, San Diego, I worked on 
infectious diseases at the nearby Salk Institute 
for Biological Studies. I decided to do gradu- 
ate studies in virology at Stanford University in 
California because it had a hospital, which made 
working on clinical applications easier. I was 
looking at immune responses related to small- 
pox and Ebola, so I flew to Maryland every few 
weeks to work in a biosafety-level-4 lab, which 
handles the most dangerous microbes. Then I 
shipped the data back to Stanford. 


You built a lab quickly after your PhD. How? 

I decided to skip the postdoc. The White- 
head Institute for Biomedical Research in 
Cambridge, Massachusetts, had a fellows 
programme that was akin to a junior faculty 
position with few teaching responsibilities. 
That seemed to bea good fit. My interests had 
shifted to the genomics of infectious disease, 
and I started laying the groundwork to study 
monkeypox infections in the Democratic 
Republic of the Congo. With a lot of hubris, 
I started my own lab. It was amazing — the 
Whitehead gave fellows a lot of leeway. In 
three years, I had secured enough money 
from the US National Institutes of Health 
and the departments of defence and home- 
land security to increase the lab to 14 people. 


Why did you apply to become an astronaut? 

It was one of those childhood dreams that I 
couldnt let go of. I thought that NASA didn't 
take biologists and so nothing would come of 
it, but I knew I would regret it if I did not apply. 


How did it feel to have to dismantle your lab 
after NASA accepted you? 

Joining NASA was very exciting, but it was 
the hardest decision I have had to make in my 
life. [had been working towards one goal for 


more than 15 years. I had been very specific 
about what I wanted to do with my career, and 
this was completely different. I was concerned 
about my lab members— wonderful people 
whom I couldn't leave high and dry. I wanted 
to make sure that they were able to continue 
their research. In the six weeks that I was given 
to shut down my lab before heading to the 
Johnson Space Center in Houston, Texas, for 
training, I found good labs for everyone to join. 


Three years later, do you still feel that joining 
NASA was the right decision? 

Yes; I am really happy. I have learned a whole 
bunch of new skills, including how to speak 
Russian, conduct a space walk and fly a super- 
sonic jet. The whole time at NASA has been a 
huge turning point for me. 


Are you able to do research? 

Yes, but it is different from having my own lab. I 
dontt get to say, “I want to do this”; they select the 
best peer-reviewed research. I bring an opera- 
tional perspective to the experiments that match 
up with my expertise. My time in biosafety labs 
taught me to work in a high-pressure environ- 
ment, which provided skills that Iam using 
at NASA. Iam working on experiments from 
immunology to bone loss in microgravity. 


Will you go into space? 

Fingers crossed. I am in the newest class of 
astronauts. The International Space Station will 
be operational until 2020, and perhaps 2028, so 
there is a chance. NASA is also building a space- 
launch system to go beyond low-Earth orbit. 
Whether either of those overlaps with my time 
frame is unknown, but it would be fantastic. 
I will go wherever NASA sends me. 


INTERVIEW BY VIRGINIA GEWIN 
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Ua SCIENCE FICTION 


BY GEORGE ZEBROWSKI 


nable to reach Felix by phone, Bruno 

| hobbled across the street to his 

physicist friend’s house and waved 

the insulting cheque at him. “What’s the 
joke, Felix?” 

Long retired, Bruno still recalled the joys 
of plumbing: the craft of figuring pipes, 
often well beyond necessity, by eye alone, 
in quality iron, copper or even plastic 
connections. It had been a while since 
anyone had asked him to do repairs, 
or even make a job estimate. But nowa 
cheque had arrived, with a post-it “thank 
you” from his lifelong friend attached to the 
return invoice. 

Felix gave him a confused look. 
“But... but you did the job while I was away.” 

“T did not; Bruno said. 

“Have you ... forgotten?” Felix asked the 
sudden stranger. 

“T didn’t send this invoice,’ Bruno said. 

“Tl see about it,’ Felix said coldly, closing 
the door. 

A month later, the scene played out again 
at Felix’s door, with Bruno waving invoice 
and cheque for bathtub fixtures he had not 
installed. 

A third time, Felix said more warmly, “No 
denying these invoices, with our signatures 
on each. So whos joshing us, Bruno?” 

“Maybe it’s you,’ Bruno said. 

“Not you, Bruno?” Felix asked. “Did we 
both forget?” 

Bruno rubbed his bad knee and leaned 
back on the sofa as they stared at the paper- 
work on the coffee table in front of them. 

“You know, Bruno,’ Felix said, “’'m only a 
so-so physicist, but I have a wild idea about 
all this?” 

“Really?” 

“Youre slipping, Bruno,” he said. 

“Slipping?” 

“Through your sum over histories.” 

“My what?” 

“Across the probabilities. Some version of 
you did all this work” 

“And you paid me,” Bruno said, saddened 
by his friend’s covering up what might be 
signs of delusion and mental decline. 

“We can check it out,” Felix said. 

They went to his bathroom and looked 

under the sink, where 
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STICKY 


Go with the flow. 


then at the immaculate workmanship in the 
shower and bath. Felix leaned down under 
the sink and fondled the drain and water- 
supply connections. “Ah, yes, nice and dry,” 
he said happily, looking around at the floor. 
“But look here; he said suddenly, and picked 
up a long grey hair from the floor. “I'm sure 
this is yours!” He held the hair up to the light 
as he stood up and said, “You know, Bruno, if 
this proves out, you'll be famous in physics, 
more than I could ever be. It’s classic Feyn- 
man and Schrédinger” 

“Ts one hair any proof?” asked Bruno. 

“One is enough... and I'll do the math- 
ematics,” Felix said. 

“Maybe there'll be another invoice,” said 
Bruno warily. 

Felix sighed. “Yes, a paper with all the 
right maths and one hair won't do. It could 
have got here a number of ways, and the rest 
is words. We'll see if it happens again, but 
even that won't be enough” 

“There’s not much we can do at the 
moment,’ Bruno said, hoping that a delay 
might rid his overworked friend of delusions. 

“Moments...” Felix mused with a glint 
in his eyes. “Think of all the infinite diverg- 
ing moments, each one going its own way 
at every juncture, every minute of the day, 
at every fraction of a second, throughout all 
the galaxies!” 

“But why — why would all this be hap- 
pening... to us?” Bruno asked, humouring 
his friend. 
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“Why does anything happen in the quan- 
tum realm? No one knows why, except that 
it does and can be described, even predicted 
and made useful in technologies. To ask why 
seems a useless question.” 

Bruno asked: “But why are we able to 
notice it?” 

‘A slip of some kind,” Felix said. 
“But this ... this spilling up from the 
small to the large...” 
“The small is affecting the large” 
“Are these divergences common?” 
Bruno asked. 
“They might be finite sets.” 
Demoralized, Bruno went home and 
slept badly, dreaming of the objects in his 
friend’s realm of work — a vast grey infinity 
of spaceless distances and winking points far 
removed from the undersides of sinks. He 
woke up thinking that the one domain could 
have nothing to do with the other, despite 
the fact that the visible world depended on 
an invisible realm. 

He got up and worked on the page proofs 
of the new edition of his physics text, which 
suddenly looked unfamiliar, as if someone 
else had written it; but finally he settled 
into its long-worked-over familiarities. 
“Probabilities are like the thin pages of an 
infinite book,” he wrote in the margin of 
the manuscript, “and sometimes they stick 
together, even flow into each other. This 
may very well be the reality that makes 
human choices possible, escaping a hard 
determinism through a form of limited 
chaos...” 

Clever thought, he told himself. 

Sticky stuff. 

Some world pages stuck, others didn't. 
Smiling, he picked up a book and riffled the 
pages, imagining plumbing analogies to the 
structure of the Universe... 

There was an annoying knock on his door, 
and Bruno resented the early interruption. 

He opened the door, and saw Felix, his old 
school friend, with toolbox in hand. 

“Well, here I am,” Felix said with a smile, 
looking happy. “Hope you didn’t break your 
kitchen sink just to get me to visit” 

Bruno was puzzled for an instant, then 
stepped aside as the plumber hobbled in. = 


George Zebrowski is an award winning 
novel and story writer, essayist, poet, editor 
and lecturer. His latest novel is Empties 
(Golden Gryphon Press), and with Gregory 
Benford he is editor of Sentinels In Honor of 
Arthur C. Clarke (Hadley Rille Books). 
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BRIEF COMMUNICATIONS ARISING 


Is the ocean food provision index biased? 


ARISING FROM B. S. Halpern et a/, Nature 488, 615-620 (2012) 


How close to maximum sustainable food provision is current seafood 
harvest from the world’s oceans? Halpern et al.' suggest that the 
answer is 25% from a global index of food provision, part of their 
multifaceted index of ocean health. Rigorous methods used for 
management, however, demonstrate that their food provision index 
is uncorrelated with actual food provision, and that global ocean food 
provision is in the range of 71-95%. Their results stem from an 
uncertain method of estimating maximum sustainable yield (MSY), 
and we believe that this approach should be avoided as a measure of 
food provision. There is a Reply to this Brief Communication Arising 
by Halpern, B. S. et al. Nature 495, http://dx.doi.org/10.1038/nature11975 
(2013). 

The first indication that the Halpern et al.' food provision index 
may be unreliable comes from examining the scores of individual 
countries whose fisheries performance is well understood. It is widely 
acknowledged that fisheries in New Zealand and the USA are well 
managed*” whereas those in Europe are not**. However, New Zealand 
has an index value of 16% and the USA 25%, whereas Europe includes 
four of the six highest values, topped by France (87%) and Spain (60%). 

We tested the reliability of their index in a number of ways. 
Crudely, their index should be about the same as dividing global 
catches by global MSY. Recent estimates of global MSY are 83-99 
million t (ref. 5), 89.5-100.9 million t (ref. 6) and 85-110 million t 
(ref. 7), which are similar to older estimates that are often rounded to 
100 million t (refs 8, 9). Dividing current catch of 78.6 million t (ref. 10) 
by MSY yields estimates for current food provision of 71-95%. 

The underlying basis for the Halpern et al.’ food provision index 
(setting aside modifications for trends, pressures and taxonomic 
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Figure 1 | Contours of the Halpern et al.' food provision index (0-100%) in 
relation to current catch and maximum catch ( C,,a,). The Halpern index is 
100% when catch is between 71% and 79% of assumed MSY, and 0% when 
catch is either zero or greater than 150% of assumed MSY. MSY is assumed to 
equal Cinax’”, based ona highly uncertain relation between MSY and Cynax. For 
large fisheries, the food provision index is zero whenever current catches are 
more than 50% of maximum catches. 


groupings) involves comparing current catches to estimated multi- 
species MSY as estimated from a regression of MSY against maximum 
catch (see Fig. 1 for details). This regression is highly uncertain: the 
ratio of their predicted MSY to actual MSY spans a 50-fold range (95% 
CI, 0.10-5.02). This is why MSY is normally estimated from biomass 
data using stock assessments. Stock assessment estimates can be used 
to ground-truth the Halpern et al. catch-based estimates of MSY. For 
example, when their method is applied to all assessed tuna and billfish 
fisheries, their predicted MSY is lower than assessment MSY values 
for the 29 (out of 36) largest fisheries, and summed predicted MSY is 
only 41% of summed biomass-estimated MSY. When these estimates 
are fed into their rule for calculating food provision (which is zero 
when catches exceed 1.5 X MSY), their food provision index for tuna 
and billfish species is 0%. By contrast, the MSY estimates from the 
assessments used to manage these tuna and billfish species result in an 
estimate of food provision of 94% (summed catches divided by 
summed MSY). 

For a more rigorous examination, we compared sustainable yield at 
a given biomass level to MSY, both from a Pella~Tomlinson model 
(Methods). We applied this model to a compilation of 234 fisheries 
stock assessments'’” for which we have reliable estimates of catch, 
biomass and MSY. For these well-studied fisheries, covering 20-25% 
of global catch, food provision is 86% (median), or 75% (weighted 
by maximum catch). Notably, food provision estimates from these 
fisheries are uncorrelated with the Halpern et al.' estimates (Fig. 2, 
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Food provision (Halpern catch proxy) 


Figure 2 | Food provision indices (percentages) estimated using the 
Halpern et al.' proxy, compared to food provision based on biomass data for 
234 fisheries comprising 20-25% of global catch (circles, see Methods for 
details). The two indices should be tightly correlated and fall on the 1:1 
‘expected’ line, but are uncorrelated (‘observed line, y = 0.015x + 73.1, 

R? = 0.0003). Grey histograms indicate the total number of fisheries at each 
value for the food provision values, with a peak at 0% for the Halpern et al.' 
proxy and at 100% for the index based on biomass data. 
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R= 0.0003). Furthermore, for fisheries with a food provision index 
of 0% from Halpern et al.’, actual food provision averages 73%. As the 
Halpern food provision index is uncorrelated with food provision 
estimates based on well-studied fisheries in many regions of the 
world", we see little value in applying it to data-poor fisheries on a 
country-by-country basis. 

The same rigorous method can be applied to Food and Agriculture 
Organization of the United Nations (FAO) status reports’®, which 
covered 70% of global catches in 2009. Re-sampling fisheries from 
FAO ranges for overexploited fisheries, fully exploited fisheries and 
underexploited fisheries’ and estimating food provision using the 
Pella~Tomlinson model yields a mean food provision index of 82% 
(95% CI, 81-84%). 

Thus, food provision from oceanic fisheries is consistently esti- 
mated to be in the range of 71-95% by a wide variety of methods, 
which is far from the 25% estimated by Halpern et al.'. In conclusion, 
Halpern et al.' believe that it must be possible to extract MSY and food 
provision estimates from a simple function of current catch and maxi- 
mum catch. Such approaches in other guises are similarly flawed’*’. 


Methods 


Sustainable yield from fisheries as a function of population size can be repre- 
sented by a Pella~Tomlinson curve'®. From this curve, food provision is the ratio 
of sustainable yield to MSY, and can be written as a function of current biomass 
(B) relative to biomass producing MSY (Bygy) plus a parameter (z) that governs 
the shape of the curve: 


food provision= 


zle-V) 1 1/(z—1) B l 1/(z—1) B Zz 
100 x max< 0, ——— | [ - ef ice 
z-1 & Busy Zz Busy 


Setting z = 1.188 ensures that Byysy occurs at 40% of unexploited biomass, which 
is the mean from 147 fisheries'*. This food provision index is 0% when biomass is 
zero or greater than 2.5 X Busy, and 100% when biomass equals Busy. 

This project was partly funded by Cooperative Agreement NA09O0AR4320075 
between the Joint Institute for Marine and Atmospheric Research (JIMAR) and 
the National Oceanic and Atmospheric Administration (NOAA). The views 
expressed herein are those of the authors and do not necessarily reflect the views 
of the NOAA or any of its subdivisions. 
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Halpern et al. reply 


REPLYING TO T. A. Branch, D. J. Hively & R. Hilborn. Nature 495, http://dx.doi.org/10.1038/nature11974 (2013) 


Branch et al.' suggest that the fisheries component of our ocean health 
and benefits index’ is ‘biased’. We contend that our approach is no 
more biased than their alternative, and that our method improves on 
theirs in three fundamental ways: it provides a score for every country, 
accounts for thousands of data-poor stocks, and incorporates sustain- 
ability more comprehensively. 

Our global score is an area-weighted country average of a multi- 
dimensional assessment of sustainable food provision, fundamentally 
different from what Branch et al." interpret it to be, which undermines 
comparisons to their global estimate. They incorrectly state that our 
index should be roughly equivalent to dividing global catch by global 
maximum sustainable yield (MSY), and incorrectly interpret a score 
of 100 in our index as B = Bysy. They focus on single-stock biomass 
harvested relative to single-stock MSY, as in their Fig. 1, and assume 
that they know true MSY values. We used a multispecies reference 
point (mMSY) with a precautionary buffer that penalizes countries 
both above and below this value, and our focus was on fisheries within 
exclusive economic zone (EEZ) scales. Branch et al.' focused on a 
single global assessment (of current status only), leading to a relatively 
small number of the largest fish stocks worldwide driving their inter- 
pretation. We chose to use catch data, available for all countries and 
most stocks, acknowledging the associated limitations and uncer- 
tainty of those data. Although their approach may be useful for assess- 
ments of global biomass production, and we agree it is preferable to 
use abundance estimates and other indicators rather than catch data 
when possible, reliance on such data precludes country-level assess- 
ments at the global scale. 

By focusing their analysis on a geographically restricted data set of 
exclusively large commercial species (<5% of all stocks in the world), 
they assume that small stocks, and the many interactions they have 
with large stocks, do not matter for understanding sustainability, and 
that the many data-poor stocks are irrelevant to understanding food 
provision. Data-poor stocks represent ~80% of all fisheries biomass 
and the vast majority of stocks in most countries. These stocks are in 
worse condition than assessed stocks, and getting worse’. Their selec- 
tion of tuna and billfish as a counter example to our results focuses on 
an outlier; these large pelagic stocks are not representative of most 
fisheries within EEZs (that is, bottom-dwelling species or those taken 
in small-scale fisheries) or most unassessed stocks. Therefore, it is 
reasonable to assume that unassessed stocks would lower fisheries 
index scores, and for our comprehensive assessment to differ from 
the narrower one of Branch et al.’ 

Because of these key differences, strong correlations between our 
methods should not necessarily be expected. In addition, assessments 


made using best available data in individual countries should generate 
different outcomes compared with a global assessment where the focus 
is on using globally available and comparable data. We know that many 
country-specific scores will change as we improve the underlying data 
and our ability to predict sustainable yield of stocks around the world. 
Our index was explicitly designed as a framework that can ingest the 
best scientific information as it becomes available, whether as better 
data or improved approaches to measuring a goal, and in fact, we are 
doing just that as we apply the index at regional scales. Authors from 
both papers are currently collaborating with the Food and Agriculture 
Organization of the United Nations to examine how to improve method- 
ologies for estimating fisheries exploitation status globally. We look 
forward to incorporating those results and improved data in future 
iterations of the index. 

This Reply is written by the subset of authors that developed the 
fisheries component of the ocean health and benefits index” relevant 
to the discussion in Branch et al.’. 
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Is the ocean food provision index biased? 


ARISING FROM B. S. Halpern et a/, Nature 488, 615-620 (2012) 


How close to maximum sustainable food provision is current seafood 
harvest from the world’s oceans? Halpern et al.' suggest that the 
answer is 25% from a global index of food provision, part of their 
multifaceted index of ocean health. Rigorous methods used for 
management, however, demonstrate that their food provision index 
is uncorrelated with actual food provision, and that global ocean food 
provision is in the range of 71-95%. Their results stem from an 
uncertain method of estimating maximum sustainable yield (MSY), 
and we believe that this approach should be avoided as a measure of 
food provision. There is a Reply to this Brief Communication Arising 
by Halpern, B. S. et al. Nature 495, http://dx.doi.org/10.1038/nature11975 
(2013). 

The first indication that the Halpern et al.' food provision index 
may be unreliable comes from examining the scores of individual 
countries whose fisheries performance is well understood. It is widely 
acknowledged that fisheries in New Zealand and the USA are well 
managed*” whereas those in Europe are not**. However, New Zealand 
has an index value of 16% and the USA 25%, whereas Europe includes 
four of the six highest values, topped by France (87%) and Spain (60%). 

We tested the reliability of their index in a number of ways. 
Crudely, their index should be about the same as dividing global 
catches by global MSY. Recent estimates of global MSY are 83-99 
million t (ref. 5), 89.5-100.9 million t (ref. 6) and 85-110 million t 
(ref. 7), which are similar to older estimates that are often rounded to 
100 million t (refs 8, 9). Dividing current catch of 78.6 million t (ref. 10) 
by MSY yields estimates for current food provision of 71-95%. 

The underlying basis for the Halpern et al.’ food provision index 
(setting aside modifications for trends, pressures and taxonomic 
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Figure 1 | Contours of the Halpern et al.' food provision index (0-100%) in 
relation to current catch and maximum catch ( C,,a,). The Halpern index is 
100% when catch is between 71% and 79% of assumed MSY, and 0% when 
catch is either zero or greater than 150% of assumed MSY. MSY is assumed to 
equal Cinax’”, based ona highly uncertain relation between MSY and Cynax. For 
large fisheries, the food provision index is zero whenever current catches are 
more than 50% of maximum catches. 


groupings) involves comparing current catches to estimated multi- 
species MSY as estimated from a regression of MSY against maximum 
catch (see Fig. 1 for details). This regression is highly uncertain: the 
ratio of their predicted MSY to actual MSY spans a 50-fold range (95% 
CI, 0.10-5.02). This is why MSY is normally estimated from biomass 
data using stock assessments. Stock assessment estimates can be used 
to ground-truth the Halpern et al. catch-based estimates of MSY. For 
example, when their method is applied to all assessed tuna and billfish 
fisheries, their predicted MSY is lower than assessment MSY values 
for the 29 (out of 36) largest fisheries, and summed predicted MSY is 
only 41% of summed biomass-estimated MSY. When these estimates 
are fed into their rule for calculating food provision (which is zero 
when catches exceed 1.5 X MSY), their food provision index for tuna 
and billfish species is 0%. By contrast, the MSY estimates from the 
assessments used to manage these tuna and billfish species result in an 
estimate of food provision of 94% (summed catches divided by 
summed MSY). 

For a more rigorous examination, we compared sustainable yield at 
a given biomass level to MSY, both from a Pella~Tomlinson model 
(Methods). We applied this model to a compilation of 234 fisheries 
stock assessments'’” for which we have reliable estimates of catch, 
biomass and MSY. For these well-studied fisheries, covering 20-25% 
of global catch, food provision is 86% (median), or 75% (weighted 
by maximum catch). Notably, food provision estimates from these 
fisheries are uncorrelated with the Halpern et al.' estimates (Fig. 2, 


TOPs S0° 
ow se 
eee ene eg 
@ ra } 
ee 
@°@ @- e@ 
ze e Opsatiee g Be oe) ell 
€ @,2-0----3-,--& 
bcd * = @ ee 
2 Oe be 2 
= 60- e @ e e ® - ® 
as 1° ks 
i= } e 7 @ 
Q @ ’e ® 
g ° *> 
3 405 e & oe e e 
Q e e7 ¥ } 
3 |° 8 J ° 
5 y, ® 
2? g e , e f 
20 ye @ 
ra 
7 ®@ e | 
e- 1 
7 
0-4_+—_+# 7 a 
0 20 40 60 80 100 50 


Food provision (Halpern catch proxy) 


Figure 2 | Food provision indices (percentages) estimated using the 
Halpern et al.' proxy, compared to food provision based on biomass data for 
234 fisheries comprising 20-25% of global catch (circles, see Methods for 
details). The two indices should be tightly correlated and fall on the 1:1 
‘expected’ line, but are uncorrelated (‘observed line, y = 0.015x + 73.1, 

R? = 0.0003). Grey histograms indicate the total number of fisheries at each 
value for the food provision values, with a peak at 0% for the Halpern et al.' 
proxy and at 100% for the index based on biomass data. 
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R= 0.0003). Furthermore, for fisheries with a food provision index 
of 0% from Halpern et al.’, actual food provision averages 73%. As the 
Halpern food provision index is uncorrelated with food provision 
estimates based on well-studied fisheries in many regions of the 
world", we see little value in applying it to data-poor fisheries on a 
country-by-country basis. 

The same rigorous method can be applied to Food and Agriculture 
Organization of the United Nations (FAO) status reports’®, which 
covered 70% of global catches in 2009. Re-sampling fisheries from 
FAO ranges for overexploited fisheries, fully exploited fisheries and 
underexploited fisheries’ and estimating food provision using the 
Pella~Tomlinson model yields a mean food provision index of 82% 
(95% CI, 81-84%). 

Thus, food provision from oceanic fisheries is consistently esti- 
mated to be in the range of 71-95% by a wide variety of methods, 
which is far from the 25% estimated by Halpern et al.'. In conclusion, 
Halpern et al.' believe that it must be possible to extract MSY and food 
provision estimates from a simple function of current catch and maxi- 
mum catch. Such approaches in other guises are similarly flawed’*’. 


Methods 


Sustainable yield from fisheries as a function of population size can be repre- 
sented by a Pella~Tomlinson curve'®. From this curve, food provision is the ratio 
of sustainable yield to MSY, and can be written as a function of current biomass 
(B) relative to biomass producing MSY (Bygy) plus a parameter (z) that governs 
the shape of the curve: 


food provision= 


zle-V) 1 1/(z—1) B l 1/(z—1) B Zz 
100 x max< 0, ——— | [ - ef ice 
z-1 & Busy Zz Busy 


Setting z = 1.188 ensures that Byysy occurs at 40% of unexploited biomass, which 
is the mean from 147 fisheries'*. This food provision index is 0% when biomass is 
zero or greater than 2.5 X Busy, and 100% when biomass equals Busy. 
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Halpern et al. reply 


REPLYING TO T. A. Branch, D. J. Hively & R. Hilborn. Nature 495, http://dx.doi.org/10.1038/nature11974 (2013) 


Branch et al.' suggest that the fisheries component of our ocean health 
and benefits index’ is ‘biased’. We contend that our approach is no 
more biased than their alternative, and that our method improves on 
theirs in three fundamental ways: it provides a score for every country, 
accounts for thousands of data-poor stocks, and incorporates sustain- 
ability more comprehensively. 

Our global score is an area-weighted country average of a multi- 
dimensional assessment of sustainable food provision, fundamentally 
different from what Branch et al." interpret it to be, which undermines 
comparisons to their global estimate. They incorrectly state that our 
index should be roughly equivalent to dividing global catch by global 
maximum sustainable yield (MSY), and incorrectly interpret a score 
of 100 in our index as B = Bysy. They focus on single-stock biomass 
harvested relative to single-stock MSY, as in their Fig. 1, and assume 
that they know true MSY values. We used a multispecies reference 
point (mMSY) with a precautionary buffer that penalizes countries 
both above and below this value, and our focus was on fisheries within 
exclusive economic zone (EEZ) scales. Branch et al.' focused on a 
single global assessment (of current status only), leading to a relatively 
small number of the largest fish stocks worldwide driving their inter- 
pretation. We chose to use catch data, available for all countries and 
most stocks, acknowledging the associated limitations and uncer- 
tainty of those data. Although their approach may be useful for assess- 
ments of global biomass production, and we agree it is preferable to 
use abundance estimates and other indicators rather than catch data 
when possible, reliance on such data precludes country-level assess- 
ments at the global scale. 

By focusing their analysis on a geographically restricted data set of 
exclusively large commercial species (<5% of all stocks in the world), 
they assume that small stocks, and the many interactions they have 
with large stocks, do not matter for understanding sustainability, and 
that the many data-poor stocks are irrelevant to understanding food 
provision. Data-poor stocks represent ~80% of all fisheries biomass 
and the vast majority of stocks in most countries. These stocks are in 
worse condition than assessed stocks, and getting worse’. Their selec- 
tion of tuna and billfish as a counter example to our results focuses on 
an outlier; these large pelagic stocks are not representative of most 
fisheries within EEZs (that is, bottom-dwelling species or those taken 
in small-scale fisheries) or most unassessed stocks. Therefore, it is 
reasonable to assume that unassessed stocks would lower fisheries 
index scores, and for our comprehensive assessment to differ from 
the narrower one of Branch et al.’ 

Because of these key differences, strong correlations between our 
methods should not necessarily be expected. In addition, assessments 


made using best available data in individual countries should generate 
different outcomes compared with a global assessment where the focus 
is on using globally available and comparable data. We know that many 
country-specific scores will change as we improve the underlying data 
and our ability to predict sustainable yield of stocks around the world. 
Our index was explicitly designed as a framework that can ingest the 
best scientific information as it becomes available, whether as better 
data or improved approaches to measuring a goal, and in fact, we are 
doing just that as we apply the index at regional scales. Authors from 
both papers are currently collaborating with the Food and Agriculture 
Organization of the United Nations to examine how to improve method- 
ologies for estimating fisheries exploitation status globally. We look 
forward to incorporating those results and improved data in future 
iterations of the index. 

This Reply is written by the subset of authors that developed the 
fisheries component of the ocean health and benefits index” relevant 
to the discussion in Branch et al.’. 
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